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NOVEL CARBOXYLESTERASE NUCLEIC ACID MOLECULES, 
PROTEINS AND USES THEREOF 



FIELD OF THE INVENTION 



The present invention relates to arthropod esterase nucleic acid molecules, 



5 proteins encoded by such nucleic acid molecules, antibodies raised against such proteins, 
and inhibitors of such proteins. The present invention also includes therapeutic 
compositions comprising such nucleic acid molecules, proteins, antibodies, and/or other 
inhibitors, as well as their use to protect an animal from hematophagous arthropod 
infestation. 



Hematophagous arthropod infestation of animals is a health and economic 
concern because hematophagous arthropods are known to cause and/or transmit a variety 
of diseases. Hematophagous arthropods directly cause a variety of diseases, including 
allergies, and also carry a variety of infectious agents including, but not limited to, 

15 endoparasites (e.g., nematodes, cestodes, trematodes and protozoa), bacteria and viruses. 
In particular, the bites of hematophagous arthropods are a problem for animals 
maintained as pets because the infestation becomes a source of annoyance not only for 
the pet but also for the pet owner who may find his or her home generally contaminated 
with insects. As such, hematophagous arthropods are a problem not only when they are 

20 on an animal but also when they are in the general environment of the animal. 

Bites from hematophagous arthropods are a particular problem because they not 
only can lead to disease transmission but also can cause a hypersensitive response in 
animals which is manifested as disease. For example, bites from fleas can cause an 
allergic disease called flea allergic (or allergy) dermatitis (FAD). A hypersensitive 

25 response in animals typically results in localired tissue inflammation and damage, 
causing substantial discomfort to the animal. 

The medical importance of arthropod infestation has prompted the development 
of reagents capable of controlling arthropod infestation. Commonly encountered 
methods to control arthropod infestation are generally focused on use of insecticides. 

30 While some of these products are efficacious, m&st, at best, offer protection of a very 
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limited duration. Furthermore, many of the methods are often not successful in reducing 
arthropod populations. In particular, insecticides have been used to prevent 
hematophagous arthropod infestation of animals by adding such insecticides to 
shampoos, powders, collars, sprays, foggers and liquid bath treatments (i.e., dips). 
5 Reduction of hematophagous arthropod infestation on the pet has been unsuccessful for 
one or more of the following reasons: (1) failure of owner compliance (frequent 
administration is required); (2) behavioral or physiological intolerance of the pet to the 
pesticide product or means of administration; and (3) the emergence of hematophagous 
arthropod populations resistant to the prescribed dose of pesticide. However, 
10 hematophagous arthropod populations have been found to become resistant to 
insecticides. 

Prior investigators have described insect carboxylesterase (CE) protein 
biochemistry, for example, Chen et al., Insect Biochem. Molec. BioL, 24:347-355, 1994; 
Whyard et al., Biochemical Genetics, 32:924, 1994 and Argentine et al., Insect Biochem. 

15 Molec Biol, 25:621-630, 1995. Other investigators have disclosed certain insect CE 
amino acid sequences, for example, Mouches et al., Proc Natl Acad Sci USA, 87:2574- 
2578, 1990 and Cooke et al., Proc Natl Acad Sci USA, 86:1426-1430, 1989, and nucleic 
acid sequence (Vaughn et al., J. Biol. Chem., 270:17044-17049, 1995). 

Prior investigators have described certain insect juvenile hormone esterase (JHE) 

20 nucleic acid and amino acid sequences: for example, sequence for Heliothis virescens is 
disclosed by Hanzlik et al., J. Biol. Chem., 264:12419-12425, 1989; Eldridge et al., App 
Environ Microbiol, 58:1583-1591, 1992; Bonning et al., Insect Biochem. Molec. BioL, 
22:453-458, 1992; Bonning et al., Natural and Engineered Pest Management Agents, pp. 
368-383, 1994 and Harshman et al., Insect Biochem. Molec. Biol, 24:671-676, 1994 ; 

25 sequence fcr P anduca sexta : s disclosed by Vankatesh et al., J Biol Chem, 265:21727- 
21732, 1990; sequence for Trichoplusia m is disclosed by Venkataraman et al., Dev. 
Genet., 15:391-400, 1994 and Jones et al., Biochem. J., 302:827-S35, 1994; and 
sequence for Lyman tria dispar is disclosed by Valaitis, Insect Biochem. Molec. BioL, 
22:639-648, 1992. 
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Identification of an esterase of the present invention is unexpected, however, 
because even the most similar nucleic acid sequence identified by previous investigators 
could not be used to identify an esterase of the present invention. In addition, 
identification of an esterase protein of the present invention is unexpected because a 
protein fraction from flea prepupal larvae that was obtained by monitoring for serine 
protease activity surprisingly also contained esterase proteins of the present invention. 

In summary, there remains a need to develop a reagent and a method to protect 
animals or plants from hematophagous arthropod infestation. 

SUMMARY OF THE INVENTION 
The present invention relates to a novel product and process for protection of 
animals or plants from arthropod infestation. According to the present invention there 
are provided arthropod esterase proteins and mimetopes thereof; arthropod nucleic acid 
molecules, including those that encode such proteins; antibodies raised against such 
esterase proteins (i.e., anti-arthropod esterase antibodies); and compounds that inhibit 
arthropod esterase activity (i.e, inhibitory compounds or inhibitors). 

The present invention also includes methods to obtain such proteins, mimetopes, 
nucleic acid molecules, antibodies and inhibitory compounds. Also included in the 
present invention are therapeutic compositions comprising such proteins, mimetopes, 
nucleic acid molecules, antibodies, and/or inhibitory compounds, as well as use of such 
therapeutic compositions to protect animals from arthropod infestation. 

Identification of an esterase of the present invention is unexpected, however, 
because the most similar nucleic acid sequence identified by previous investigators could 
not be used to identify an esterase of the present invention. In addition, identification of 
an esterase protein of the present invention is unexpected because a protein fraction from 
flea prepupal larvae that was obtained by uionitoring for serine protease activity 
surprisingly also contained esterase proteins of the present invention. 

One embodiment of the present invention is an isolated nucleic acid molecule 
that hybridizes under stringent hybridization conditions with a gene comprising a nucleic 
acid sequence including SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
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NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID 
5 NO:51, SEQ ID NO:52, SEQ ID NO:57, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
NO:61, SEQ ID NO:67, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID 
NO:72, SEQ ID NO:76 and/or a nucleic acid molecule encoding a protein comprising 
amino acid sequence SEQ ID NO:74. 

The present invention also includes a nucleic acid molecule that hybridizes under 

10 stringent hybridization conditions with a nucleic acid molecule encoding a protein 

comprising at least one of the following amino acid sequences:SEQ ID NO:2, SEQ ID 
NO:5, SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID NO:14, SEQ ID NO: 19, SEQ ID NO:25, 
SEQ ID NO:31, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ 
ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:53, SEQ ID NO:54, SEQ ID 

15 NO:55, SEQ ID NO:58, SEQ ID NO:68, SEQ ID NO:73 and/or SEQ ID NO:74; and 

particularly a nucleic acid molecule that hybridizes with a nucleic acid sequence that is a 
complement of a nucleic acid sequence encoding any of the amino acid sequences. A 
preferred nucleic acid molecule of the present invention includes a nucleic acid molecule 
comprising a nucleic acid sequence including SEQ ID NO:l, SEQ ID NO:3, SEQ ID 

20 NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ 
ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID 
NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID 

25 NO:38, SbQ iD NO:51, SEQ ID NO:52, SEQ ID NO:57, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ID NO:67, SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
NO:71, SEQ ID NO:72, SEQ ID NO:76 and/or a nucleic acid molecule encoding a 
protein comprising amino acid sequence SEQ ID NO:74, and allelic variants thereof. 
The present invention also includes an isolated carboxylesterase nucleic acid 

30 molecule comprising a nucleic acid sequence encoding a protein comprising an amino 
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acid sequence including SEQ ID NO:5, SEQ ID NO: 19, SEQ ID NO:39, SEQ ID 
NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44 and/or SEQ ID 
NO:53. SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID 
NO:43 and SEQ ID NO:44 represent N-terminal amino acid sequences of 
5 carboxyl esterases isolated from prepupal flea larvae, the production of which are 
described in the Examples of the present application. 

The present invention also relates to recombinant molecules, recombinant viruses 
and recombinant cells that include a nucleic acid molecule of the present invention. 
Also included are methods to produce such nucleic acid molecules, recombinant 

1 0 molecules, recombinant viruses and recombinant cells. 

Another embodiment of the present invention includes an isolated esterase 
protein that is encoded by a nucleic acid molecule that hybridizes under stringent 
hybridization conditions to (a) a nucleic acid molecule that includes at least one of the 
following nucleic acid sequences:SEQ ID NO:3, SEQ ID NO:6, SEQ ID NO:9, SEQ ID 

15 NO:12, SEQ ID NO.T5, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
NO:26, SEQ ID NO:29, SEQ ID NO:32, SEQ ID NO:35, SEQ ID NO:38, SEQ ID 
NO:52, SEQ ID NO:59, SEQ ID NO:61, SEQ ID NO:69, and SEQ ID NO:71; and/or (b) 
a nucleic acid molecule encoding a protein including at least one of the following amino 
acid sequences: SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ 

20 ID NO:43, SEQ ID NO:44, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55 and SEQ 
ID NO:74. One embodiment is a carboxylesterase protein encoded by a nucleic acid 
molecule that hybridizes under stringent hybridization conditions to a nucleic acid 
molecule that encodes a protein comprising at least one of the following amino acid 
sequences: SEQ ID NO:5, SEQ ID NO: 19,' SEQ ID NO:39, SEQ ID NO:40, SEQ ID 

25 NO:41, SEQ ID NO:42, SEQ ID NO 43, SEQ ID NO:44 and/or SEQ ID NO:53. 

Preferred proteins of the present invention are isolated flea proteins including at least 
one of the following amino acid sequences:SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:S, 
SEQ ID NO: 1 1, SEQ ID NO: 14, SEQ ID NO: 19, SEQ ID NO:25, SEQ ID NO:31, SEQ 
ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID 

30 NO:43, SEQ ID NO:44, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 
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NO:58, SEQ ID NO:68, SEQ ID NO:73 and SEQ ID NO:74; also included are proteins 
encoded by allelic variants of nucleic acid molecules encoding proteins comprising any 
of the above-listed amino acid sequences. 

The present invention also relates to mimetopes of arthropod esterase proteins as 
5 well as to isolated antibodies that selectively bind to arthropod esterase proteins or 
mimetopes thereof. Also included are methods, including recombinant methods, to 
produce proteins, mimetopes and antibodies of the present invention. 

The present invention also includes a formulation of flea carboxylesterase 
proteins, in which the proteins, when submitted to 14% Tris-glycine SDS-PAGE, 
1 0 comprise a fractionation profile as depicted in Fig. 3, in which the proteins have 
carboxylesterase activity. 

Also included in the present invention is a formulation of flea carboxylesterase 
proteins, in which the proteins, when submitted to IEF-PAGE, comprise a fractionation 
profile as depicted in Fig. 4, lane 3, lane 4, lane 5, lane 6 and/or lane 7, wherein the 
15 proteins have carboxylesterase activity. 

Another embodiment of the present invention is an isolated flea protein or a 
formulation of flea proteins that hydrolyzes a-napthyl acetate to produce a-napthol, 
when the protein is incubated in the presence of a-napthyl acetate contained in 20 mM 
Tris at pH 8.0 for about 15 minutes at about 37°C. 
20 Yet another embodiment of the present invention is an isolated flea protein or a 

formulation of flea proteins that hydrolyzes the methyl ester group of juvenile hormone 
to produce a juvenile hormone acid. 

Another embodiment of the present invention is a method to identify a compound 
capable of inhibiting flea carboxylesterase activity, the method comorising: (a) 
25 con'ac'mg an isolated flea carboxylesterase with a putative inhibitory compound under 
conditions in which, in the absence of the compound, the protein has carboxylesterase 
activity; and (b) determining if the putative inhibitory compound inhibits the activity. 
Also included in the present invention is a test kit to identify a compound capable of 
inhibiting flea carboxylesterase activity, the test kit comprising an isolated flea 
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carboxylesterase protein having esterase activity and a means for determining the extent 
of inhibition of the activity in the presence of a putative inhibitory compound. 

Yet another embodiment of the present invention is a therapeutic composition 
that is capable of reducing hematophagous ectoparasite infestation. Such a therapeutic 
composition includes at least one of the following protective compounds: an isolated 
hematophagous ectoparasite carboxylesterase protein or a mimetope thereof, an isolated 
carboxylesterase nucleic acid molecule that hybridizes under stringent hybridization 
conditions with a Oenocephalides felis carboxylesterase gene, an isolated antibody that 
selectively binds to a hematophagous ectoparasite carboxylesterase protein, and an 
inhibitor of carboxylesterase activity identified by its ability to inhibit the activity of a 
flea carboxylesterase. A therapeutic composition of the present invention can also 
include an excipient, an adjuvant and/or a carrier. Preferred esterase nucleic acid 
molecule compounds of the present invention include naked nucleic acid vaccines, 
recombinant virus vaccines and recombinant cell vaccines. Also included in the present 
invention is a method to protect an animal from hematophagous ectoparasite infestation, 
comprising the step of administering to the animal a therapeutic composition of the 
present invention. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 depicts SDS-PAGE analysis of DFP-labeled esterase proteins. 
Fig. 2 depicts carboxylesterase activity of certain esterase proteins of the present 
invention. 

Fig. 3 depicts SDS-PAGE analysis of carboxylesterase activity of certain esterase 
proteins of the present invention. 

Fig. 4 depicts IEF analysis of certain esterase proteins of the present invention. 

Fig. 5 depicts juvenile hormone esterase activity of certain esterase proteins of 
the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides for isolated arthropod esterase proteins, isolated 
arthropod esterase nucleic acid molecules, antibodies directed against arthropod esterase 
proteins and other inhibitors of arthropod esterase activity. As used herein, the terms 
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isolated arthropod esterase proteins and isolated arthropod esterase nucleic acid 
molecules refers to esterase proteins and esterase nucleic acid molecules derived from 
arthropods and, as such, can be obtained from their natural source or can be produced 
using, for example, recombinant nucleic acid technology or chemical synthesis. Also 
5 included in the present invention is the use of these proteins, nucleic acid molecules, 
antibodies and inhibitors as therapeutic compositions to protect animals from 
hematophagous ectoparasite infestation as well as in other applications, such as those 
disclosed below. 

Arthropod esterase proteins and nucleic acid molecules of the present invention 
10 have utility because they represent novel targets for anti-arthropod vaccines and drugs. 
The products and processes of the present invention are advantageous because they 
enable the inhibition of arthropod development, metamorphosis, feeding, digestion and 
reproduction processes that involve esterases. White not being bound by theory, it is 
believed that expression of arthropod esterase proteins are developmental^ regulated, 
15 thereby suggesting that esterase proteins are involved in arthropod development and/or 
reproduction. The present invention is particularly advantageous because the protons of 
the present invention were identified in larval fleas, thereby suggesting the importance of 
the proteins as developmental proteins. 

One embodiment of the present invention is an esterase formulation that includes 
70 one or more esterase proteins capable of binding to diisopropylfluorophosphate (DFP). 
A preferred embodiment of an esterase formulation of the present invention composes 
one or more arthropod esterase proteins that range in molecular weight from about 20 
kilodaltons (kD) to about 200 kD, more preferably from about 40 kD to about 100 kD, 
and even more preferably from about 60 kD to about 75 kD, as determined by SDS- 
25 PAGE (sodium dodecyl sulfate polyacrylamide gel elec^o moresis). An even more 
preferred formulation includes one or more flea esterase proteins having elution (or 

migration) patterns as shown in Fig. 1. 

Another embodiment of the present invention is a formulation comprising one or 
more hematophagous ectoparasite carboxylesterase (CE) proteins. The present invention 
30 includes the discovery that such a formulation has general CE activity. General CE 
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activity can be identified using methods known to those of skill in the art and described 
in the Examples section herein. A suitable formulation of the present invention 
comprises one or more flea proteins capable of hydrolyzing cc-napthyl acetate to produce 
a-napthol when the proteins are incubated in the presence of ex-napthyl acetate contained 
in 20 mm Tris at pH 8.0 for about 15 minutes at about 37°C. General CE activity can be 
identified following such incubation by detecting the production of from about 0.3 to 
about 2.5 absorbance units in the presence of Fast Blue when measured at 590 nm. 

A preferred CE formulation of the present invention includes one or more flea 
CE proteins having acidic to neutral isoelectric points, or pi values. An isoelectric pH, 
or pi, value refers to the pH value at which a molecule has no net electric charge and 
fails to move in an electric field. A preferred formulation of the present invention 
includes one or more proteins having a pi value ranging from about pi 2 to about 10, 
more preferably from about pi 3 to about 8, and even more preferably from about pi 4.7 
to about 5.2, as determined by IEF-PAGE. 

An esterase formulation, including a CE formulation, of the present invention can 
be prepared by a method that includes the steps of: (a) preparing an extract by isolating 
flea tissue, homogenizing the tissue by sonication and clarifying the extract by 
centrifugation at a low speed spin, e.g., about 18,000 rpm for about 30 minutes; (b) 
recovering soluble proteins from said centrifuged extract and applying the proteins to a 
p-aminobenzamidine agarose bead column; (c) recovering unbound protein from the 
column and clarifying by filtration; (d) applying the clarified protein to a gel filtration 
column and eluting and collecting fractions with esterase activity; (e) dilayzing the 
eluate against 20 mM MES buffer, pH 6.0, containing 10 mM NaCl; (f) applying the 
dialysate to a cation exchange chromatography column, eluting protein bound to the 
5 column with a linear grader t of from about 10 mM NaCl to about 1 M NaCl in 20 mM 
MES buffer, pH 6, and collecting fractions having esterase activity; (g) adjusting the pH 
of the resulting fractions to pH 7 and applying the fractions to an anion exchange 
chromatography column; (h) eluting protein bound to the column with a linear gradient 
of from about 0 to about 1 M NaCl in 25 mM Tris buffer, pH 6.8 and collecting fractions 
0 having esterase activity, such activity elutes from the column at about 1 70 mM NaCl. 
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Tissue can be obtained from unfed fleas or from fleas that recently consumed a 
blood meal (i.e., blood-fed fleas). Such flea tissues are referred to herein as, 
respectively, unfed flea and fed flea tissue. Preferred flea tissue from which to obtain an 
esterase formulation of the present invention includes pre-pupal larval tissue, wandering 
flea larvae, 3rd instar tissue, fed adult tissue and unfed adult tissue. 

In a preferred embodiment, a CE formulation of the present invention comprises 
a flea protein comprising amino acid sequence SEQ ID NO:5, SEQ ID NO:19, SEQ ID 
NO:39, SEQ ID NO.40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID 

NO:44 and/or SEQ ID NO:53. 

Another embodiment of the present invention is a juvenile hormone esterase 
(JHE) formulation comprising one or more arthropod JHE proteins, the arthropod being 
of the order Hemiptera, Anoplura, Mallophaga, Diptera, Siphonaptera, Parasitiformes, 
Acariformes and Acarina. The present invention includes the discovery that such a 
formulation has JHE activity. JHE activity can be identified using methods known to 
those of skill in the art and described in the Examples section herein. A suitable 
formulation of the present invention comprises one or more arthropod proteins capable 
of hydrolyzing a methyl ester group of juvenile hormone to produce a juvenile hormone 
acid. Preferably, such a protein is capable of releasing of at least about 120 counts per 
minute when such a protein is incubated in the presence of >H-juvenile hormone to 
create a reaction mixture, the reaction mixture is combined with isooctane, the aqueous 
phase is recovered and the amount of 'H-juvenile hormone present in that phase is 
determined. Such a protein is also preferably capable of causing release of methane thiol 
when such protein is incubated in the presence of methyl 1-heptylthioacetothioate 
(HEPTAT) using the method generally disclosed in McCutchen et al., Insect Biochem. 
75 Molec. Biol, Vol. 25, No. 1, pg 1 19-126, 1995, which is incorporated in its entirety by 
this reference. 

In one embodiment, a juvenile hormone esterase formulation of the present 
invention comprises a protein comprising amino acid sequence SEQ ID NO:74. 
According to the present invention, an arthropod that is not of the order 
30 lepidoptera includes an arthropod of the order Hemiptera, Anoplura, Mallophaga, 
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Diptera, Siphonaptera, Parasitiformes, Acariformes and Acarina, Preferred arthropods 
include Hemiptera cimicidae, Hemiptera reduviidae, Anoplura pediculidae, Anoplura 
pthiridae, Diptera culicidae, Diptera simuliidae, Diptera psychodidae, Diptera 
ceratopogonidae, Diptera chaoboridae, Diptera tabanidae, Diptera rhagionidae, 
athericidae, Diptera chloropidae, Diptera muscidae, Diptera hippoboscidae, Diptera 
calliphoridae, Diptera sarcophagidae, Diptera oestridae, Diptera gastrophilidae, Diptera 
cuterebridae, Siphonaptera ceratophyllidae, Siphonaptera leptopsyllidae, Siphonaptera 
pulicidae, Siphonaptera tungidae, Parasitiformes dermanyssidae, Acariformes 
tetranychidae, Acariformes cheyletide, Acariformes demodicidae, Acariformes 
erythraeidae, Acariformes trombiculidae, Acariformes psoroptidae, Acariformes 
sarcoptidae, Acarina argasidae and Acarina ixodidae. Preferred Diptera muscidae 
include Musca, Hydrotaea, Stomoxys Haematobia. Preferred Siphonaptera include 
Ceratophyllidae nosopsyllus. Ceratophyllidae diamanus, Ceratophyllidae ceratophyllus, 
Leptopsyllidae leptopsylla, Pulicidae pulex, Pulicidae ctenocephalides, Pulicidae 
15 xen opsylla, Pulicidae echidnophaga and Tungidae tunga. Preferred Parasitiformes 
dermanyssidae include Ornithonyssus and Liponyssoides. Preferred Acarina include 
Argasidae argas, Argasidae ornithodoros, Argasidae otobius, Ixodidae Ixodes. Ixodidae 
hyalomma, Ixodidae nosomma, Ixodidae rhipicephalus, Ixodidae boophilus, Ixodidae 
dermacentor, Ixodidae haemaphysalus. Ixodidae amblyomma and Ixodidae anocentor. 
20 One embodiment of a JHE formulation of the present invention is one or more 

arthropod JHE proteins that range in molecular weight from about 20 kD to about 200 
kD, more preferably from about 40 kD to about 100 kD, and even more preferably from 
about 60 kD to about 75 kD, as determined by SDS-PAGE (sodium dodecyl sulfate 
polyacrylamide gel electrophoresis). 
25 A JHE formulate of the present invention can be prepared by a method diat 

includes the steps of: (a) preparing soluble proteins from arthropod extracts as described 
above for CE purification and purifying such soluble proteins by gel filtration; (b) 
collecting fractions having JHE activity from the gel filtration step, loading the fractions 
onto a cation exchange column, eluting the cation exchange column with a linear 
30 gradient of from about 10 mM NaCl to about 1 M NaCl in 20 «M MES buffer, P H 6 and 
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collecting fractions having JHE activity; (c) adjusting the pH of the collected fractions to 
about pH 7 are dialyzed against about 10 mM phosphate buffer (pH 7.2) containing 
about 10 mM NaCl; (d) applying the dialysate to a hydroxyapatite column, eluting 
protein bound to the column with a linear gradient of from about 10 mM phosphate 
5 buffer (pH 7.2) containing 10 mM NaCl to about 0.5 M phosphate buffer (pH 6.5) 

containing 10 mM NaCl and collecting fractions having JHE activity; (e) dialyzing the 
fractions against 20 mM Tris buffer (pH 8.0) containing 10 mM NaCl; (f) applying the 
dialysate an anion exchange chromatography column and eluting protein bound to the 
column with a linear gradient of from about 10 mM to about 1 M NaCl in 20 mM Tris 
1 0 buffer, pH 8 and collecting fractions containing JHE activity. 

A JHE formulation of the present invention can be prepared by a method that 
includes the steps of: (a) preparing flea extracts as described herein in the Examples 
section and applying the extract to p-aminobenzamidine linked agarose beads and 
collecting protein not bound to the beads; (b) applying the unbound protein to a 
15 Superdex 200 HR gel filtration column and collecting fractions having JHE activity; (c) 
applying the fractions to an anion exchange chromatography column, eluting the anion 
exchange column with a linear gradient of 0 to 1 M NaCl in 25 mM Tris buffer, pH 6.S 
and collecting fractions having JHE activity; (d) dialyzing the fractions overnight against 
about 1 L of 20 mM Tris buffer, pH 8.0, containing 10 mM NaCl; (c) applying the 
20 dialysate to a Poros 10 HQ anion exchange column, eluting the column with buffer 
containing about 120 mM NaCl and collecting fractions having JHE activity. 

Suitable arthropods from which to isolate a JHE formulation of the present 
invention include, but are not limited to agricultural pests, stored product pests, forest 
pests, structural pests or animal health pests. Suitable agricultural pests of the present 
2' invention include , but are not limited to Color.de potato beetle-, corn earworms, 
fleahoppers, weevils, pink boll worms, cotton aphids, beet armyworms, lygus bugs, 
hessian flies, sod webworms, whites grubs, diamond back moths, white flies, 
planthoppers, leafhoppers, mealy bugs, mormon crickets and mole crickets. Suitable 
stored product pests of the present invention include, but are not limited to demiestids, 
30 anobeids, saw toothed grain beetles, indian mealmoths, flour beetles, long-horn wood 
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boring beetles and metallic wood boring beetles. Suitable forest pests of the present 
invention include , but are not limited to southern pine bark bettles, gypsy moths, elm 
beetles, ambrosia bettles, bag worms, tent worms and tussock moths. Suitable structural 
pests of the present invention include, but are not limited to, bess beetles, termites, fire 
ants, carpenter ants, wasps, hornets, cockroaches, silverfish, Musca domestica and 
Musca autumnalis. Suitable animal health pests of the present invention include, but are 
not limited to fleas, ticks, mosquitoes, black flies, lice, true bugs, sand flies, 
Psychodidae, tsetse flies, sheep blow flies, cattle grub, mites, horn flies, heel flies, deer 
flies, Culicoides and warble flies. Preferred arthropods from which to isolate a JHE 
formulation of the present invention include fleas, midges, mosquitos, sand flies, black 
flies, horse flies, snipe flies, louse flies, horn flies, deer flies, tsetse flies, buffalo flies, 
blow flies, stable flies, myiasis-causing flies, biting gnats, lice, mites, bee, wasps, ants, 
true bugs and ticks, preferably fleas, ticks and blow flies, and more preferably fleas. 
Preferred fleas from which to isolate JHE proteins include Ctenocephalides, 
i Ceratophyllus, Diamanus, Echidnophaga, Nosopsyllus, Pulex, Tunga, Oropsylla, 
Orchopeus and Xenopsylla. More preferred fleas include Ctenocephalides felis, 
Ctenocephalides canis, Ceratophyllus pulicidae, Pulex irritans, Oropsylla (Thrassis) 
bacchi, Oropsylla (Diamanus) montana, Orchopeus howrdi, Xenopsylla cheopis and 
Pulex simulans, with C. felis being even more preferred. 
0 Suitable tissue from which to isolate a JHE formulation of the present invention 

includes unfed fleas or fleas that recently consumed a blood meal (i.e., blood-fed fleas). 
Such flea tissues are referred to herein as, respectively, unfed flea and fed flea tissue. 
Preferred flea tissue from which to obtain a JHE formulation of the present invention 
includes pre-pupal larval tissue, 3rd instar tissue, fed or unfed adult tissue, with unfed 
15 adult gut tissue bei .ig more preferred than fed or unfed whole adult tissue. It is of note 
that a JHE formulation of the present invention obtained from pre-pupal larval tissue 
does not hydrolyze a-napthyl acetate. 

Another embodiment of the present invention is an esterase formulation 
comprising a combination of one or more arthropod CE and JHE proteins of the present 
30 invention. Suitable arthropods from which to isolate a combined CE and JHE 
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formulation include those arthropods described herein for the isolation of a JHE 
formulation of the present invention. Preferred arthropods from which to isolate a 
combined CE and JHE formulation include fleas, midges, mosquitos, sand flies, black 
flies, horse flies, horn flies, deer flies, tsetse flies, buffalo flies, blow flies, stable flies, 
myiasis-causing flies, biting gnats, lice, bee, wasps, ants, true bugs and ticks, preferably 
fleas, ticks and blow flies, and more preferably fleas. Suitable flea tissue from which to 
isolate a combined CE and JHE formulation of the present invention includes 3rd instar 
tissue, fed or unfed adult tissue and unfed adult tissue, with unfed adult gut tissue being 
more preferred than fed or unfed whole adult tissue. 

In one embodiment, a formulation of the present invention comprises an esterase 
having both CE and JHE activity. Preferably, a formulation of the present invention 
that comprises an esterase having both CE and JHE activity comprises a flea protein 
comprising amino acid sequence SEQ ID NO:8 and/or SEQ ID NO:37. 

Another embodiment of the present invention is an isolated protein comprising 
an arthropod esterase protein. It is to be noted that the term "a" or "an" entity refers to 
one or more of that entity; for example, a protein refers to one or more proteins or at 
least one protein. As such, the terms "a" (or "an"), "one or more" and "at least one" can 
be used interchangeably herein. It is also to be noted that the terms "comprising", 
"including", and "having" can be used interchangeably. Furthermore, a compound 
"selected from the group consisting of refers to one or more of the compounds in the 
list that follows, including mixtures (i.e., combinations) of two or more of the 
compounds. According to the present invention, an isolated, or biologically pure, 
protein, is a protein that has been removed from its natural milieu. As such, "isolated" 
and "biologically pure" do not necessarily reflect the extent to which the protein has 
> been purified. An isolated protein of the preset .nvention can be obtained from its 

natural source, can be produced using recombinant DNA technology or can be produced 

by chemical synthesis. 

As used herein, an isolated arthropod esterase protein can be a full-length protein 
or any homolog of such a protein. An isolated protein of the present invention, 
0 including a homolog, can be identified in a straight- forward manner by the protein's 
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ability to elicit an immune response against arthropod esterase proteins, to hydrolyze a- 
napthyl acetate, to hydrolyze the methyl ester group of juvenile hormone or bind to DFP. 
Esterase proteins of the present invention include CE and JHE proteins. As such, an 
esterase protein of the present invention can comprise a protein capable of hydrolyzing 
5 a-napthyl acetate, hydrolyzing the methyl ester group of juvenile hormone and/or 
binding to DFP. Examples of esterase homologs include esterase proteins in which 
amino acids have been deleted (e.g., a truncated version of the protein, such as a 
peptide), inserted, inverted, substituted and/or derivatized (e.g., by glycosylate, 
phosphorylation, acetylation, myristoylation, prenylation, palmitoylation, amidation 
0 and/or addition of glycerophosphatidyl inositol) such that the homolog includes at least 
one epitope capable of eliciting an immune response against an arthropod esterase 
protein. That is, when the homolog is administered to an animal as an immunogen, 
using techniques known to those skilled in the art, the animal will produce an immune 
response against at least one epitope of a natural arthropod esterase protein. The ability 
15 of a protein to effect an immune response, can be measured using techniques known to 
those skilled in the art. Esterase protein homologs of the present invention also include 
esterase proteins that hydrolyze a-napthyl acetate and/or that hydrolyze the methyl ester 

group of juvenile hormone. 

Arthropod esterase protein homologs can be the result of natural allelic variation 
20 or natural mutation. Esterase protein homologs of the present invention can also be 
produced using techniques known in the art including, but not limited to, direct 
modifications to the protein or modifications to the gene encoding the protein using, for 
example, classic or recombinant nucleic acid techniques to effect random or targeted 
mutagenesis. 

25 Isolateu e .terase proteins of the present invention have the furth ,r characteristic 

of being encoded by nucleic acid molecules that hybridize under stringent hybridization 
conditions to a gene encoding a Ctenocephalides felis protein (i.e., a C.felis esterase 
gene). As used herein, stringent hybridization conditions refer to standard hybridization 
conditions under which nucleic acid molecules, including oligonucleotides, are used to 
30 identify similar nucleic acid molecules. Such standard conditions are disclosed, for 
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example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Labs Press, 1989; Sambrook et al., ibid., is incorporated by reference herein in its 
entirety. Stringent hybridization conditions typically permit isolation of nucleic acid 
molecules having at least about 70% nucleic acid sequence identity with the nucleic acid 
5 molecule being used to probe in the hybridization reaction. Formulae to calculate the 
appropriate hybridization and wash conditions to achieve hybridization permitting 30% 
or less mismatch of nucleotides are disclosed, for example, in Meinkoth et al., 1984, 
Anal. Biochem. 138, 267-284; Meinkoth et al., ibid., is incorporated by reference herein 
in its entirety. 

10 As used herein, a C. felis esterase gene includes all nucleic acid sequences related 

to a natural C. felis esterase gene such as regulatory regions that control production of 
the C. felis esterase protein encoded by that gene (such as, but not limited to, 
transcription, translation or post-translation control regions) as well as the coding region 
itself. In one embodiment, a C. felis esterase gene of the present invention includes the 
15 nucleic acid sequence SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ 
ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
20 NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:57, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
NO:61, SEQ ID NO:67, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID 
NO:72, SEQ ID NO:76 and/or a nucleic acid molecule encoding a protein comprising 
amino acid sequence SEQ ID NO:74. Nucleic acid sequence SEQ ID NO:l represents 
25 the deduced sequence of the coding sTavd of a PCR amplified nucleic acid molecule 
denoted herein as nfEl 4 o„ the production of which is disclosed in the Examples. The 
complement of SEQ ID NO:l (represented herein by SEQ ID NO:3) refers to the nucleic 
acid sequence of the strand complementary to the strand having SEQ ID NO:l , which 
can easily be determined by those skilled in the art. Likewise, a nucleic acid sequence 
30 complement of any nucleic acid sequence of the present invention refers to the nucleic 
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acid sequence of the nucleic acid strand that is complementary to (i.e., can form a 
complete double helix with) the strand for which the sequence is cited. 

Nucleic acid sequence SEQ ID NO:4 represents the deduced sequence of the 
coding strand of a PCR amplified nucleic acid molecule denoted herein as nfE2 364 , the 
5 production of which is disclosed in the Examples. The complement of SEQ ID NO:4 is 
represented herein by SEQ ID NO:6. 

Nucleic acid sequence SEQ ID NO:7 represents the deduced sequence of the 
coding strand of a PCR amplified nucleic acid molecule denoted herein as nfE3 421 , the 
production of which is disclosed in the Examples. The complement of SEQ ID NO:7 is 
10 represented herein by SEQ ID NO:9. 

Nucleic acid sequence SEQ ID NO: 10 represents the deduced sequence of the 
coding strand of a PCR amplified nucleic acid molecule denoted herein as nfE4 524 , the 
production of which is disclosed in the Examples. The complement of SEQ ID NO: 10 is 
represented herein by SEQ ID NO: 12. 
15 Nucleic acid sequence SEQ ID NO: 1 3 represents the deduced sequence of the 

coding strand of an apparent coding region of a complementary DNA (cDNA) nucleic 
acid molecule denoted herein as nfE5 1982 , the production of which is disclosed in the 
Examples. The complement of SEQ ID NO:l 3 is represented herein by SEQ ID NO: 1 5. 
Nucleic acid sequence SEQ ID NO: 18 represents the deduced sequence of the 
20 coding strand of an apparent coding region of a cDNA nucleic acid molecule denoted 
herein as nfE6 1792 , the production of which is disclosed in the Examples. The 
complement of SEQ ID NO: 18 is represented herein by SEQ ID NO:20. 

Nucleic acid sequence SEQ ID NO:24 represents the deduced sequence of the 
coding strand of an apparent coding region of a cDNA nucleic acid molecule denoted 
25 herein as • ifE7 2g36 , the production of which is disclosed in the Exam F ks. The 
complement of SEQ ID NO:24 is represented herein by SEQ ID NO:26. 

Nucleic acid sequence SEQ ID NO:30 represents the deduced sequence of the 
coding strand of an apparent coding region of a cDNA nucleic acid molecule denoted 
herein as nfE8 280 „ the production of which is disclosed in the Examples. The 
30 complement of SEQ ID NO:30 is represented herein by SEQ ID NO:32. 
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Nucleic acid sequence SEQ ID NO:36 represents the deduced sequence of the 
coding strand of an apparent coding region of a cDNA nucleic acid molecule denoted 
herein as nfE9 200 7, the production of which is disclosed in the Examples. The 
complement of SEQ ID NO:36 is represented herein by SEQ ID NO:38. 
5 Nucleic acid sequence SEQ ID NO:57 represents the deduced sequence of the 

coding strand of an apparent coding region of a cDNA nucleic acid molecule denoted 
herein as nfE5 2144) the production of which is disclosed in the Examples. The 
complement of SEQ ID NO:57 is represented herein by SEQ ID NO:59. 

Nucleic acid sequence SEQ ID NO:67 represents the deduced sequence of the 
1 0 coding strand of an apparent coding region of a cDNA nucleic acid molecule denoted 
herein as nfE10 1987 , the production of which is disclosed in the Examples. The 
complement of SEQ ID NO:67 is represented herein by SEQ ID NO:69. 

It should be noted that since nucleic acid sequencing technology is not entirely 
error-free, the nucleic acid sequences and amino acid sequences presented herein 
15 represent, respectively, apparent nucleic acid sequences of nucleic acid molecules of the 
present invention and apparent amino acid sequences of esterase proteins of the present 
invention. 

In another embodiment, a C.felis esterase gene can be an allelic variant that 
includes a similar but not identical sequence to SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
20 NO*. SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ 
ID NO.20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID 
NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID 
25 NO:38, SEQ ID NO:51, SEQ ID NO:5/, SEQ ID NO:57, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ID NO:67, SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
NO:71, SEQ ID NO:72, SEQ ID NO:76 and/or a nucleic acid molecule encoding a 
protein' comprising amino acid sequence SEQ ID NO:74. An allelic variant of a C.felis 
esterase gene is a gene that occurs at essentially the same locus (or loci) in the genome 
30 as the gene including SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ 
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ID NO:7, SEQ ID N0:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ ID 
NO:15, SEQ ID N0:16, SEQ ID NO:17, SEQ ID N0:18, SEQ ID NO:20, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:27', SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
5 NO:33', SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID 
NO:5l', SEQ ID NO:52, SEQ ID NO:57, SEQ ID NO:59, SEQ ID NO.60, SEQ ID 
NO:6l] SEQ ID NO:67, SEQ ID NO:69, SEQ ID NO.70, SEQ ID N0:71, SEQ ID 
NO:72, SEQ ID NO:76 and/or a nucleic acid molecule encoding a protein comprising 
amino acid sequence SEQ ID NO:74, but which, due to natural variations caused by, for 
1 0 example, mutation or recombination, has a similar but not identical sequence. Allelic 
variants typically encode proteins having similar activity to that of the protein encoded 
by the gene to which they are being compared. Allelic variants can also comprise 
alterations in the 5' or 3' untranslated regions of the gene (e.g., in regulatory control 
regions). Allelic variants are well known to those skilled in the art and would be 
15 expected to be found within a given arthropod since the genome is diploid and/or among 

a group of two or more arthropods. 

The minimal size of an esterase protein homolog of the present invention is a size 
sufficient to be encoded by a nucleic acid molecule capable of forming a stable hybrid 
(i.e., hybridize under stringent hybridization conditions) with the complementary 
20 sequence of a nucleic acid molecule encoding the corresponding natural protein. As 
such, the size of the nucleic acid molecule encoding such a protein homolog is 
dependent on nucleic acid composition and percent homology between the nucleic acid 
molecule and complementary sequence. It should also be noted that the extent of 
homology required to form a stable hybrid can vary depending on whether the 
25 homo;.f. ,us sequences are interspersed throughout the nucleic acid molecules or are 

clustered (i.e., localized) in distinct regions on the nucleic acid molecules. The minimal 
size of such nucleic acid molecules is typically at least about 12 to about 15 nucleotides 
in length if the nucleic acid molecules are GC-rich and at least about 15 to about 17 
bases in length if they are AT-rich. As such, the minimal size of a nucleic acid molecule 
30 used to encode an esterase protein homolog of the present invention is from about 12 to 
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about 18 nucleotides in length. Thus, the minimal size of an esterase protein homolog of 
the present invention is from about 4 to about 6 amino acids in length. There is no limit, 
other than a practical limit, on the maximal size of such a nucleic acid molecule in that 
the nucleic acid molecule can include a portion of a gene, an entire gene, multiple genes, 
5 or portions thereof. The preferred size of a protein encoded by a nucleic acid molecule 
of the present invention depends on whether a full-length, fusion, multivalent, or 
functional portion of such a protein is desired. 

One embodiment of the present invention includes an arthropod esterase protein 
having CE enzyme activity. Such a CE protein preferably includes: a catalytic triad of 
1 0 serine - histidine - glutamic acid as well as the essential amino acids arginine and 
aspartic acid at positions similar to those described for juvenile hormone esterase, for 
example by Ward et al., 1992, Int JBiochem 24: 1933-1941; this reference is 
incorporated by reference herein in its entirety. Analysis of the apparent full-length 
protein sequences disclosed herein indicates that each of these amino acid sequences 
15 includes these amino acid motifs, as well as surrounding consensus sequences. 

Suitable arthropods from which to isolate esterase proteins having general CE 
activity of the present invention (including isolation of the natural protein or production 
of the protein by recombinant or synthetic techniques) preferably include insects and 
acarines but not Culicidae, Drosophilidae, Calliphoridae, Sphingidae, Lymantriidae, 
20 Noctuidae, Fulgoroidae and Aphididae. Preferred arthropods from which to isolate CE 
proteins having general CE activity include fleas, ticks, black flies, lice, true bugs, sand 
flies, Psychodidae, tsetse flies, cattle grub, mites, horn flies, heel flies, deer flies, 
Culicoides and warble flies. Preferred arthropods from which to isolate an esterase 
proteins having general CE activity include fleas, midges,, sand flies, black flies, horse 
25 flies, snipe flies, louse flies, ho-n flies, deer flies, tsetse flies, buffalo flies, blow flies, 
stable flies, myiasis-causing flies, biting gnats, lice, mites, bee, wasps, ants, true bugs 
and ticks, preferably fleas, ticks and blow flies, and more preferably fleas. Preferred 
fleas from which to isolate esterase proteins having general CE activity include 
Ctenocephalides, Ceratophyllus, Diamanus, Echidnophaga, Nosopsyllus, Pulex, Tunga, 
30 Oropsylla, Orchopeus and Xenopsylla. More preferred fleas include Ctenocephalides 
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felis, Ctenocephalides canis, Ceratophyllus pulicidae, Pulex irritans, Oropsylla 
(Thrassis) bacchi, Oropsylla (Diamanus) montana, Orchopeus howardi, Xenopsylla 
cheopis and Pulex simulans, with C.felis being even more preferred. 

A preferred arthropod esterase protein of the present invention is a compound 
that when administered to an animal in an effective manner, is capable of protecting that 
animal from hematophagous ectoparasite infestation. In accordance with the present 
invention, the ability of an esterase protein of the present invention to protect an animal 
from hematophagous ectoparasite infestation refers to the ability of that protein to, for 
example, treat, ameliorate and/or prevent infestatipn caused by hematophagous 
arthropods. In particular, the phrase "to protect an animal from hematophagous 
ectoparasite infestation" refers to reducing the potential for hematophagous ectoparasite 
population expansion on and around the animal (i.e., reducing the hematophagous 
ectoparasite burden). Preferably, the hematophagous ectoparasite population size is 
decreased, optimally to an extent that the animal is no longer bothered by 
hematophagous ectoparasites. A host animal, as used herein, is an animal from which 
hematophagous ectoparasites can feed by attaching to and feeding through the skin of 
the animal. Hematophagous ectoparasites, and other ectoparasites, can live on a host 
animal for an extended period of time or can attach temporarily to an animal in order to 
feed. At any given time, a certain percentage of a hematophagous ectoparasite * 
population can be on a host animal whereas the remainder can be in the environment of 
the animal. Such an environment can include not only adult hematophagous 
ectoparasites, but also hematophagous ectoparasite eggs and/or hematophagous 
ectoparasite larvae. The environment can be of any size such that hematophagous 
ectoparasites in the environment are able to jump onto and off of a host animal. For 
example, the environment of an animal can include plants, such as crops, from which 
hematophagous ectoparasites infest an animal. As such, it is desirable not only to reduce 
the hematophagous ectoparasite burden on an animal per se, but also to reduce the 
hematophagous ectoparasite burden in the environment of the animal. In one 
embodiment, an esterase protein of the present invention can elicit an immune response 
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(including a humoral and/or cellular immune response) against a hematophagous 
ectoparasite. 

Suitable hematophagous ectoparasites to target include any hematophagous 
ectoparasite that is essentially incapable of infesting an animal administered an esterase 

5 protein of the present invention. As such, a hematophagous ectoparasite to target 
includes any hematophagous ectoparasite that produces a protein having one or more 
epitopes that can be targeted by a humoral and/or cellular immune response against an 
esterase protein of the present invention and/or that can be targeted by a compound that 
otherwise inhibits esterase activity (e.g., a compound that inhibits hydrolysis of a- 

10 napthyl acetate, hydrolysis of the methyl ester group of juvenile hormone, and/or binds 
to DFP), thereby resulting in the decreased ability of the hematophagous ectoparasite to 
infest an animal. Preferred hematophagous ectoparasite to target include ectoparasites 
disclosed herein as being useful in the production of esterase proteins of the present 
invention. 

15 The present invention also includes mimetopes of esterase proteins of the present 

invention. As used herein, a mimetope of an esterase protein of the present invention 
refers to any compound that is able to mimic, the activity of such a protein (e.g., ability to 
elicit an immune response against an arthropod esterase protein of the present invention 
and/or ability to inhibit esterase activity), often because the mimetope has a structure 

20 that mimics the esterase protein. It is to be noted, however, that the mimetope need not 
have a structure similar to an esterase protein as long as the mimetope functionally 
mimics the protein. Mimetopes can be, but are not limited to: peptides that have been 
modified to decrease their susceptibility to degradation; anti-idiotypic and/or catalytic 
antibodies, or fragments thereof; non-proteinaceous immunogenic portions of an isolated 

25 protein (e.g., carbohydrate s^u ,tures); synthetic or natural organic or inorganic 
molecules, including nucleic acids; and/or any other peptidomimetic compounds. 
Mimetopes of the present invention can be designed using computer-generated structures 
of esterase proteins of the present invention. Mimetopes can also be obtained by 
generating random samples of molecules, such as oligonucleotides, peptides or other 
30 organic molecules, and screening such samples by affinity chromatography techniques 
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using the corresponding binding partner, (e.g., an esterase substrate, an esterase substrate 
analog, or an anti-esterase antibody). A preferred mimetope is a peptidomimetic 
compound that is structurally and/or functionally similar to an esterase protein of the 
present invention, particularly to the active site of the esterase protein. 

5 The present invention also includes mimetopes of esterase proteins of the present 

invention. As used herein, a mimetope of an esterase protein of the present invention 
refers to any compound that is able to mimic the activity of such an esterase protein, 
often because the mimetope has a structure that mimics the esterase protein. Mimetopes 
can be, but are not limited to: peptides that have been modified to decrease their 

1 0 susceptibility to degradation; anti-idiotypic and/or catalytic antibodies, or fragments 
thereof; non-proteinaceous immunogenic portions of an isolated protein (e.g., 
carbohydrate structures); and synthetic or natural organic molecules, including nucleic 
acids. Such mimetopes can be designed using computer-generated structures of proteins 
of the present invention. Mimetopes can also be obtained by generating random samples 

15 of molecules, such as oligonucleotides, peptides or other organic molecules, and 

screening such samples by affinity chromatography techniques using the corresponding 
binding partner. 

One embodiment of an arthropod esterase protein of the present invention is a 
fusion protein that includes an arthropod esterase protein-containing domain attached to 
20 one or more fusion segments. Suitable fusion segments for use with the present 

invention include, but are not limited to, segments that can: enhance a protein's stability; 
act as an immunopotentiator to enhance an immune response against an esterase protein; 
and/or assist purification of an esterase protein (e.g., by affinity chromatography). A 
suitable fusion segment can be a domain of any size that has the desired function (e.g., 
25 imparts increased stability, imparts increased immune genicity to a protein, and/or 
simplifies purification of a protein). Fusion segments can be joined to amino and/or 
carboxyl termini of the esterase-containing domain of the protein and can be susceptible 
to cleavage in order to enable straight-forward recovery of an esterase protein. Fusion 
proteins are preferably produced by culturing a recombinant cell transformed with a 
30 fusion nucleic acid molecule that encodes a protein including the fusion segment 
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attached to either the carboxyl and/or amino terminal end of an esterase-containing 
domain. Preferred fusion segments include a metal binding domain (e.g., a poly- 
histidine segment); an immunoglobulin binding domain (e.g., Protein A; Protein G; T 
cell; B cell; Fc receptor or complement protein antibody-binding domains); a sugar 
5 binding domain (e.g., a maltose binding domain); and/or a "tag" domain (e.g., at least a 
portion of P-galactosidase, a strep tag peptide, other domains that can be purified using 
compounds that bind to the domain, such as monoclonal antibodies). More preferred 
fusion segments include metal binding domains, such as a poly-histidine segment; a 
maltose binding domain; a strep tag peptide, such as that available from Biometra in 
10 Tampa, FL; and an S10 peptide. Examples of particularly preferred fusion proteins of 
the present invention include PHIS-PfE6 540 , PHIS-PfE7 275 , PHIS-PfB7 570 , PHIS-PfES 570 
and PHIS-PfE9 52g , production of which are disclosed herein. 

In another embodiment, an arthropod esterase protein of the present invention 
also includes at least one additional protein segment that is capable of protecting an 
1 5 animal from hematophagous ectoparasite infestations. Such a multivalent protective 
protein can be produced by culturing a cell transformed with a nucleic acid molecule 
comprising two or more nucleic acid domains joined together in such a manner that the 
resulting nucleic acid molecule is expressed as a multivalent protective compound 
containing at least two protective compounds, or portions thereof, capable of protecting 
20 an animal from hematophagous ectoparasite infestation by, for example, targeting two 

different arthropod proteins. 

Examples of multivalent protective compounds include, but are not limited to, an 
esterase protein of the present invention attached to one or more compounds protective 
against one or more arthropod compounds. Preferred second compounds are 

25 proteinaceous compoui .ds that effect active immunization (e.g., antigen vaccines;, 

passive immunization (e.g., antibodies), or that otherwise inhibit a arthropod activity that 
when inhibited can reduce hematophagous ectoparasite burden on and around an animal. 
Examples of second compounds include a compound that inhibits binding between an 
arthropod protein and its ligand (e.g., a compound that inhibits flea ATPase activity or a 

30 compound that inhibits binding of a peptide or steroid hormone to its receptor), a 



-25- 



FC-1-C1-PCT 



compound that inhibits hormone (including peptide or steroid hormone) synthesis, a 
compound that inhibits vitellogenesis (including production of vitellin and/or transport 
and maturation thereof into a major egg yolk protein), a compound that inhibits fat body 
function, a compound that inhibits muscle action, a compound that inhibits the nervous 
5 system, a compound that inhibits the immune system and/or a compound that inhibits 
hematophagous ectoparasite feeding. Examples of second compounds also include 
proteins obtained from different stages of hematophagous ectoparasite development. 
Particular examples of second compounds include, but are not limited to, serine 
proteases, cysteine proteases, aminopeptidases, serine protease inhibitor proteins, 
10 calreticulins, larval serum proteins and echdysone receptors, as well as antibodies to and 
inhibitors of such proteins. In one embodiment, an arthropod esterase protein of the 
present invention is attached to one or more additional compounds protective against 
hematophagous ectoparasite infestation. In another embodiment, one or more protective 
compounds, such as those listed above, can be included in a multivalent vaccine 
15 comprising an arthropod esterase protein of the present invention and one or more other 
protective molecules as separate compounds. 

A preferred isolated protein of the present invention is a protein encoded by a 
nucleic acid molecule that hybridizes under stringent hybridization conditions with 
nucleic acid molecules nfEl 40 i> nfE2 3W , nfE3 421) nfE4 524 , nfE5 l9S2 , nfE5 1515 , nf£5 2144 , 
20 nfE5 I650 , nfE6 14 88, nfE6 l792 , nfE6 1650 , nfE7 2S36 , nfE7 l788 , nfE7 m0 , nfE7 050 , nfES 2SOl , 
nfE8 1785 , nfE8 1710 , nfE9 2007 , nf£9 1584 , nfE9 1540 , nfE10 1987 and/or nfE10 1590 . A further 
preferred isolated protein is encoded by a nucleic acid molecule that hybridizes under 
stringent hybridization conditions with a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO:3, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:12, SEQ ID NO:15, 
25 SEQ T D NO:17, SEQ ID NO:20, SEQ ID NO:22, JLQ ID NO:26, SEQ ID NO:29, SEQ 
ID NO:32, SEQ ID NO:35, SEQ ID NO:38, SEQ ID NO:52, SEQ ID NO:59, SEQ ID 
NO:61, SEQ ID NO:69 and/or SEQ ID NO:71. 

Translation of SEQ ID NO:l suggests that nucleic acid molecule nfEl 401 encodes 
a non-full-length arthropod esterase protein of about 103 amino acids, referred to herein 
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as PfEl 103> represented by SEQ ID NO:2, assuming the first codon spans from nucleotide 
92 through nucleotide 94 of SEQ ID NO:l. 

Comparison of amino acid sequence SEQ ID NO:2 (i.e., the amino acid sequence 
of PfEl 103 ) with amino acid sequences reported in GenBank indicates that SEQ ID NO:2, 
showed the most homology, i.e., about 33% identity, between SEQ ID NO:2 and alpha 
esterase protein from Drosophila melanogaster. . 

Translation of SEQ ID NO:4 suggests that nucleic acid molecule nfE2 364 encodes 
a non-full-length arthropod esterase protein of about 121 amino acids, referred to herein 
as PfE2 12 „ represented by SEQ ID NO:5, assuming the first codon spans from nucleotide 
2 through nucleotide 4 of SEQ ID NO:4. 

Comparison of amino acid sequence SEQ ID NO:5 (i.e., the amino acid sequence 
of PfE2 121 ) with amino acid sequences reported in GenBank indicates that SEQ ID NO:5, 
showed the most homology, i.e., about 38% identity, between SEQ ID NO:5 and alpha 
esterase protein from Drosophila melanogaster. 

Translation of SEQ ID NO:7 suggests that nucleic acid molecule nfE3 421 encodes 
a non-full-length arthropod esterase protein of about 103 amino acids, referred to herein 
as PfE3 l03 , represented by SEQ ID NO:8, assuming the first codon spans from nucleotide 
1 13 through nucleotide 1 15 of SEQ ID NO:7. 

Comparison of amino acid sequence SEQ ID NO:8 (i.e., the amino acid sequence 
of PfE3 10 3) with amino acid sequences reported in GenBank indicates that SEQ ID NO:S, 
showed the most homology, i.e., about 39% identity, between SEQ ID NO:8 and alpha 
esterase protein from Drosophila melanogaster. 

Translation of SEQ ID NO:10 suggests that nucleic acid molecule nfE4 524 
encodes a non-full-length arthropod esterase protein of about 137 amino acids, referred 
> to herein as PfE4 lJ7 , represented by SEQ ID NO:ll, assuming the first rod m spans from 
nucleotide 113 through nucleotide 115 of SEQ ID NO: 10. 

Comparison of amino acid sequence SEQ ID NO:l 1 (i.e., the amino acid 
sequence of PfE4 137 ) with amino acid sequences reported in GenBank indicates that SEQ 
ID NO: 1 1, showed the most homology, i.e., about 30% identity, between SEQ ID NO: 1 1 
0 and Leptinotarsa decemlineata acetylcholinesterase. 
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Translation of SEQ ID NO:57 suggests that nucleic acid molecule nfE5 2144 
encodes a full-length arthropod esterase protein of about 550 amino acids, referred to 
herein as PfE5 5 5o, represented by SEQ ID NO:58, assuming an open reading frame in 
which the initiation codon spans from nucleotide 30 through nucleotide 32 of SEQ ID 
5 NO:57 and the termination (stop) codon spans from nucleotide 1680 through nucleotide 
1682 of SEQ ID NO:57. The complement of SEQ ID NO:57 is represented herein by 
SEQ ID NO:59. The coding region encoding Pf£5 550 is represented by the nucleic acid 
molecule nfE5 1650 , having a coding strand with the nucleic acid sequence represented by 
SEQ ID NO:60 and a complementary strand with nucleic acid sequence SEQ ID NO:61 . 
10 The deduced amino acid sequence of PfE5 550 (i.e., SEQ ID NO:58) predicts that PfE5 550 
has an estimated molecular weight of about 61.8 kD and an estimated pi of about 5.5. 

Comparison of amino acid sequence SEQ ID NO:58 (i.e., the amino acid 
sequence of PfE5 550 ) with amino acid sequences reported in GenBank indicates that SEQ 
ID NO:58 showed the most homology, i.e., about 36% identity between SEQ ID NO:58 
1 5 and Drosophila melanogaster alpha esterase protein. 

Translation of SEQ ID NO: 18 suggests that nucleic acid molecule nfE6 179 , 
encodes a full-length arthropod esterase protein of about 550 amino acids, referred to 
herein as PfE6 550 , represented by SEQ ID NO: 19, assuming an open reading frame 
having an initiation codon spanning from nucleotide 49 through nucleotide 51 of SEQ 
20 ID NO: 18 and a stop codon spanning from nucleotide 1699 through nucleotide 1 701 of 
SEQ ID NO: 1 8. The coding region encoding PfE6 550 , is represented by nucleic acid 
molecule nfE6 I650 , having a coding strand with the nucleic acid sequence represented by 
SEQ ID NO:21 and a complementary strand with nucleic acid sequence SEQ ID NO:22. 
The proposed mature protein, denoted herein as PfE6 530 , contains about 530 amino acids 
25 which is represented herein as SEQ ID NO:5 J. The nucleic acid molecule encoding 
PfE6 530 is denoted herein as nf£6 1590 and has a coding strand having the nucleic acid 
sequence SEQ ID NO:23. The deduced amino acid sequence SEQ ID NO:19 suggests a 
protein having a molecular weight of about 61.8 kD and an estimated pi of about 5.5. 
Comparison of amino acid sequence SEQ ID NO:19 (i.e., the amino acid 
30 sequence of PfE6 550 ) with amino acid sequences reported in GenBank indicates that SEQ 
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j> showed the most homology, i.e., about 28% identity between SEQ ID NO: 19 
bphila melanogaster alpha esterase protein. 

ranslation of SEQ ID NO:24 suggests that nucleic acid molecule nfE7 2g36 
i full-length arthropod esterase protein of about 596 amino acids, referred to 
PfE7 5% , represented by SEQ ID NO:25, assuming an open reading frame 
l initiation codon spanning from nucleotide 99 through nucleotide 101 of SEQ 
4 and a stop codon spanning from nucleotide 1887 through nucleotide 1889 of 
NO:24. The coding region encoding PfE7 596 , is represented by nucleic acid 
j. nfE7, 7SS , having a coding strand with the nucleic acid sequence represented by 
NO:28 and a complementary strand with nucleic acid sequence SEQ ID NO:29. 
iosed mature protein, denoted herein as PfE7 570 , contains about 570 amino acids 
represented herein as SEQ ID NO:54. The nucleic acid molecule encoding 
s denoted herein as nfE7 1710 and has a coding strand having the nucleic acid 

SEQ ID NO:27. The deduced amino acid sequence SEQ ID NO:25 suggests a 
laving a molecular weight of about 68.7 kD and an estimated pi of about 6.1. 
:omparison of amino acid sequence SEQ ID NO:25 (i.e., the amino acid 
e of PfE7 5% ) with amino acid sequences reported in GenBank indicates that SEQ 
15 showed the most homology, i.e., about 27% identity between SEQ ID NO:25 
>sophiIa melanogaster alpha esterase protein. 

Translation of SEQ ID NO:30 suggests that nucleic acid molecule nfE8 2S01 
» a full-length arthropod esterase protein of about 595 amino acids, referred to 
ts PfE8 595 , represented by SEQ ID NO:31, assuming an open reading frame 
an initiation codon spanning from nucleotide 99 through nucleotide 101 of SEQ 
30 and a stop codon spanning from nucleotide 1884 through nucleotide 1886 of 
) NO- 30. The coding region encoding PfE8 595 , is represented by nucleic acid 
Ie nfE8, 785 , having a coding strand with the nucleic acid sequence represented by 
NO:34 and a complementary strand with nucleic acid sequence SEQ ID NO:35. 
Dposed mature protein, denoted herein as PfE8 570 , contains about 570 amino acids 
is represented herein as SEQ ID NO:55. The nucleic acid molecule encoding 
, is denoted herein as nfE8 1710 and has a coding strand having the nucleic acid 
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sequence SEQ ID NO:33. The deduced amino acid sequence SEQ ID NO:31 suggests a 
protein having a molecular weight of about 68.6 kD and an estimated pi of about 6.1. 

Comparison of amino acid sequence SEQ ID NO:31 (i.e., the amino acid 
sequence of PfE8 595 ) with amino acid sequences reported in GenBank indicates that SEQ 
5 ID NO:31 showed the most homology, i.e., about 28% identity between SEQ ID NO:31 
and estalpha-2 esterase of Culex pipiens quinquefasciatus. 

Translation of SEQ ID NO:36 suggests that nucleic acid molecule nfE9 2007 
encodes a full-length arthropod esterase protein of about 528 amino acids, referred to 
herein as PfE9 528 , represented by SEQ ID NO:37, assuming an open reading frame 
10 having an initiation codon spanning from nucleotide 1 1 through nucleotide 13 of SEQ 
ID NO:36 and a stop codon spanning from nucleotide 1595 through nucleotide 1597 of 
SEQ ID NO:36. The coding region encoding PfE9 528 , is represented by nucleic acid 
molecule nfE9 15S4 , having a coding strand with the nucleic acid sequence represented by 
SEQ ID NO:51 and a complementary strand with nucleic acid sequence SEQ ID NO:52. 
1 5 The deduced amino acid sequence SEQ ID NO:37 suggests a protein having a molecular 
weight of about 60 kD and an estimated pi of about 5.43. 

Comparison of amino acid sequence SEQ ID NO:37 (i.e., the amino acid 
sequence of PfE9 528 ) with amino acid sequences reported in GenBank indicates that SEQ 
ID NO:37 showed the most homology, i.e., about 37% identity between SEQ ID NO:37 
20 and alpha esterase protein from Drosophila melanogaster. 

Translation of SEQ ID NO:67 suggests that nucleic acid molecule nfE10 l9S7 
encodes a full-length flea esterase protein of about 530 amino acids, referred to herein as 
PfE10 530 , having amino acid sequence SEQ ID NO:68, assuming an open reading frame 
in which the initiation codon spans from nucleotide 231 through nucleotide 233 of SEQ 
25 ID NO:67 and a stop codon spanning from aucleotide 1821 through nucleotide 1823 of 
SEQ ID NO:67. The complement of SEQ ID NO:67 is represented herein by SEQ ID 
NO:69. The coding region encoding PfE10 S30 , is represented by nucleic acid molecule 
nfE10 1590 , having a coding strand with the nucleic acid sequence represented by SEQ ID 
NO:70 and a complementary strand with nucleic acid sequence SEQ ID NO:71 . The 
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amino acid sequence of PfE10 530 (i.e., SEQ ID NO:68) predicts that PfE10 530 has an 
estimated molecular weight of about 59.5 kD and an estimated pi of about 5.5. 

Comparison of amino acid sequence SEQ ID NO:68 (i.e., the amino acid 
sequence of PfE10 530 ) with amino acid sequences reported in GenBank indicates that 
5 SEQ ID NO:68 showed the most homology, i.e., about 30% identity between SEQ ID 
NO:68 and Culexpipens esterase bl precurser protein (swissprot # P16854). 

More preferred arthropod esterase proteins of the present invention include 
proteins comprising amino acid sequences that are at least about 40%, preferably at least 
about 45%, more preferably at least about 50%, even more preferably at least about 
1 0 5 5%, even more preferably at least about 60%, even more preferably at least about 70%, 
even more preferably at least about 80%, even more preferably at least about 90%, and 
even more preferably at least about 95%, identical to amino acid sequence SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:ll, SEQ ID NO:14, SEQ ID NO:19, 
SEQ ID NO:25, SEQ ID NO:31, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ 
1 5 ID NO:41 , SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:53, SEQ ID 
NO:54, SEQ ID NO:55, SEQ ID NO:58, SEQ ID NO:68, SEQ ID NO:73 and/or SEQ ID 
NO:74. 

More preferred arthropod esterase proteins of the present invention include 
proteins encoded by a nucleic acid molecule comprising at least a portion of nfEl 401 , 
20 nfE2 364 , nfE3 421 , nfE4 524 , nfE5 1982 , nfE5 1515 , nfE5 2144 , nfE5 1050 , nfE6 14S5 , nfE6 1792 , nfE6 1050 , 
nfE7 2S36 , nfE7 1788 , nf£7 1710 , nfE7 650 , nfE8 2801 , nfE8 1785 , nfE8 1710 , nfE9 2007 , nfE9 1584 , 
nf£9 1540 , nfE10 1987 and/or nf£10 1590 , or of allelic variants of such nucleic acid molecules. 
More preferred is an esterase protein encoded by nfEl 40 i, nfE2 364 , nfE3 421 , nfE4 524 , 
nfE5 1982 , nfE5 I515 , nfE5 2144 , nfE5 I650 , nfE6 1488 , nfE6 1792 , nfE6 1650 , nfE7 2836 , nfE7 178S , 
25 nfE7 I710 , n.E- 650 , nfE8 2801 , nfE8 1785 , nfE8 17I0 , nfE9 2007 , nfE9 I584 , nfF.9 1540 , nfE10 19 - 7 and/or 
niE10 1590 , or by an allelic variant of such nucleic acid molecules. Particularly preferred 
arthropod esterase proteins are PfEl 103 , PfE2 12 „ PfE3 103 , PfE4 137 , PfE5 505 , PfE5 550 , 
PfE6 550 , PfE6 530 , PfE7 596 , PfE7 570 , PfE8 595 , PfE8 s70 , PfE9 528 and PfE10 530 . 

In one embodiment, a preferred esterase protein of the present invention is 
30 encoded by at least a portion of SEQ ID NO:l, SEQ ID NO:4, SEQ ID NO:7, SEQ ID 
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NO: 10, SEQ ID NO: 13, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:21, SEQ ID 
NO:23, SEQ ID NO:24, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID N0:51, SEQ ID NO:57, SEQ ID 
NO:60 and/or SEQ ID NO:67, and, as such, has an amino acid sequence that includes at 
least a portion of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID 
NO:14, SEQ ID NO: 19, SEQ ID NO:25, SEQ ID NO:31, SEQ ID NO:37, SEQ ID 
NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:58 and/or SEQ ID NO:68. Also 
preferred is a protein encoded by an allelic variant of a nucleic acid molecule comprising 
at least a portion of the above-listed nucleic acid sequences. 

Particularly preferred esterase proteins of the present invention include SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID NO: 14, SEQ ID NO:19, 
SEQ ID NO:25, SEQ ID NO:31, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ 
ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:53, SEQ ID 
NO:54, SEQ ID NO:55, SEQ ID NO:58, SEQ ID NO:68, SEQ ID NO:73 and/or SEQ ID 
NO:74. (including, but not limited to, the proteins consisting of such sequences, fusion 
proteins and multivalent proteins) and proteins encoded by allelic variants of SEQ ID 
NO:l, SEQ ID NO:4, SEQ ID NO:7, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:16, 
SEQ ID NO:18, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:27, SEQ 
ID NO:28, SEQ ID NO:30, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID 
NO:51, SEQ ID NO:57, SEQ ID NO:60 and/or SEQ ID NO:67. 

Another embodiment of the present invention is an isolated nucleic acid molecule 
that hybridizes under stringent hybridization conditions with a C.felis esterase gene. 
The identifying characteristics of such a gene are heretofore described. A nucleic acid 
molecule of the present invention can include an isolated natural arthropod esterase gene 
or a homolog thereof, the latter ef v hich is described in more detail below. A nucleic 
acid molecule of the present invention can include one or more regulatory regions, full- 
length or partial coding regions, or combinations thereof. The minimal size of a nucleic 
acid molecule of the present invention is the minimal size that can form a stable hybrid 
with a C.felis esterase gene under stringent hybridization conditions. 
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In accordance with the present invention, an isolated nucleic acid molecule is a 
nucleic acid molecule that has been removed from its natural milieu (i.e., that has been 
subject to human manipulation) and can include DNA, RNA, or derivatives of either 
DNA or RNA. As such, "isolated" does not reflect the extent to which the nucleic acid 
5 molecule has been purified. An isolated arthropod esterase nucleic acid molecule of the 
present invention can be isolated from its natural source or can be produced using 
recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, 
cloning) or chemical synthesis. Isolated esterase nucleic acid molecules can include, for 
example, natural allelic variants and nucleic acid molecules modified by nucleotide 
1 0 insertions, deletions, substitutions, and/or inversions in a manner such that the 

modifications do not substantially interfere with the nucleic acid molecule's ability to 
encode an esterase protein of the present invention or to form stable hybrids under 
stringent conditions with natural gene isolates. 

An arthropod esterase nucleic acid molecule homolog can be produced using a 
15 number of methods known to those skilled in the art (see, for example, Sambrook et al., 
ibid.). For example, nucleic acid molecules can be modified using a variety of 
techniques including, but not limited to, classic mutagenesis and recombinant DNA 
techniques (e.g., site-directed mutagenesis, chemical treatment, restriction enzyme 
cleavage, ligation of nucleic acid fragments and/or PCR amplification), synthesis of 
20 oligonucleotide mixtures and ligation of mixture groups to "build" a mixture of nucleic 
acid molecules and combinations thereof. Nucleic acid molecule homologs can be 
selected by hybridization with a C.felis esterase gene or by screening for the function of 
a protein encoded by the nucleic acid molecule (e.g., ability to elicit an immune response 
against at least one epitope of an arthropod esterase protein, hydrolyze cc-napthyl acetate, 
25 hvdr jlyze the methyl ester group of juvenile hormone and/or bmd to DFP). 

An isolated nucleic acid molecule of the present invention can include a nucleic 
acid sequence that encodes at least one arthropod esterase protein of the present 
invention, examples of such proteins being disclosed herein. Although the phrase 
"nucleic acid molecule" primarily refers to the physical nucleic acid molecule and the 
30 phrase "nucleic acid sequence" primarily refers to the sequence of nucleotides on the 



-33- 



FC-1-C1-PCT 



nucleic acid molecule, the two phrases can be used interchangeably, especially with 
respect to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding 
an arthropod esterase protein. 

A preferred nucleic acid molecule of the present invention, when administered to 
5 an animal, is capable of protecting that animal from infestation by a hematophagous 
ectoparasite. As will be disclosed in more detail below, such a nucleic acid molecule 
can be, or can encode, an antisense RNA, a molecule capable of triple helix formation, a 
ribozyme, or other nucleic acid-based drug compound. In additional embodiments, a 
nucleic acid molecule of the present invention can encode a protective esterase protein 
1 0 (e.g., an esterase protein of the present invention), the nucleic acid molecule being 

delivered to the animal, for example, by direct injection (i.e, as a naked nucleic acid) or 
in a vehicle such as a recombinant virus vaccine or a recombinant cell vaccine. 

One embodiment of the present invention is an esterase nucleic acid molecule 
that hybridizes under stringent hybridization conditions with nucleic acid molecule 
15 nfEUi and preferably with a nucleic acid molecule having nucleic acid sequence SEQ 

ID NO:l and/or SEQ ID NO:3. 

Another embodiment of the present invention is an esterase nucleic acid 
molecule that hybridizes under stringent hybridization conditions with nucleic acid 
molecule nfE2 364 and preferably with a nucleic acid molecule having nucleic acid 
20 sequence SEQ ID NO:4 and/or SEQ ID NO:6. 

Another embodiment of the present invention is an esterase nucleic acid 
molecule that hybridizes under stringent hybridization conditions with nucleic acid 
molecule nfE3 421 and preferably with a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO:7 and/or SEQ ID NO:9. 
25 Another embodiment of present invention is an esterase nucleic acid 

molecule that hybridizes under stringent hybridization conditions with nucleic acid 
molecule nfE4 524 and preferably with a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO:10 and/or SEQ ID NO:12. 

Another embodiment of the present invention is an esterase nucleic acid 
30 molecule that hybridizes under stringent hybridization conditions with nucleic acid 
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molecule nfE5 2l44 and preferably with a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO:57 and/or SEQ ID NO:59. 

Another embodiment of the present invention is an esterase nucleic acid 
molecule that hybridizes under stringent hybridization conditions with nucleic acid 
molecule nfE6 1792 and preferably with a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO: 1 8 and/or SEQ ID NO:20. 

Another embodiment of the present invention is an esterase nucleic acid 
molecule that hybridizes under stringent hybridization conditions with nucleic acid 
molecule nfE7 2836 and preferably with a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO:24 and/or SEQ ID NO:26. 

Another embodiment of the present invention is an esterase nucleic acid 
molecule that hybridizes under stringent hybridization conditions with nucleic acid 
molecule nfE8 2801 and preferably with a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO:30 and/or SEQ ID NO:32. 

Another embodiment of the present invention is an esterase nucleic acid 
molecule that hybridizes under stringent hybridization conditions with nucleic acid 
molecule nfE9 2007 and preferably with a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO:36 and/or SEQ ID NO:38. 

Another embodiment of the present invention is an esterase nucleic acid 
molecule that hybridizes under stringent hybridization conditions with nucleic acid 
molecule nfE10 1987 and preferably with a nucleic acid molecule having nucleic acid 
sequence SEQ ID NO:67 and/or SEQ ID NO:69. 

Comparison of nucleic acid sequence SEQ ID NO:l (i.e., the nucleic acid 
sequence of nfEl 401 ) with nucleic acid sequences reported in GenBank indicates that 
> SLQ ID NO:l showed no identifiable identity with anv sequence reported in GenBank. 

Comparison of nucleic acid sequence SEQ ID NO:4 (i.e., the coding strand of 
nucleic acid sequence of nfE2 364 ) with nucleic acid sequences reported in GenBank 
indicates that SEQ ID NO:4 showed the most homolog, i.e., about 43% identity, between 
SEQ ID NO:4 and a H. virescens juvenile hormone esterase gene. 
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Comparison of nucleic acid sequence SEQ ID NO:7 (i.e., the coding strand of 
nucleic acid sequence of nfE3 421 ) with nucleic acid sequences reported in GenBank 
indicates that SEQ ID NO:7 showed the most homolog, i.e., about 53% identity, between 
SEQ ID NO:7 and a Torpedo marmorata acetylcholinesterase gene. 
5 Comparison of nucleic acid sequence SEQ ID NO:10 (i.e., the coding strand of 

nucleic acid sequence of nfE4 524 ) with nucleic acid sequences reported in GenBank 
indicates that SEQ ID NO:10 showed the most homolog, i.e., about 47% identity, 
between SEQ ID NO: 10 and an Anas platyrhyncos thioesterase B gene. 

Comparison of nucleic acid sequence SEQ ID NO:57 (i.e., the coding strand of 
10 nucleic acid sequence of nf£5 2144 ) with nucleic acid sequences reported in GenBank 
indicates that SEQ ID NO:57 showed the most homolog, i.e., about 41% identity, 
between SEQ ID NO:57 and a esterase mRNA from Myzus persicae. 

Comparison of nucleic acid sequence SEQ ID NO:18 (i.e., the coding strand of 
nucleic acid sequence of nfE6 1V92 ) with nucleic acid sequences reported in GenBank 
15 indicates that SEQ ID NO:18 showed the most homolog, i.e., about 41% identity, 
between SEQ ID NO: 18 and a esterase gene from Myzus persicae. 

Comparison of nucleic acid sequence SEQ ID NO:24 (i.e., the coding strand of 
nucleic acid sequence of nfE7 2836 ) with nucleic acid sequences reported in GenBank 
indicates that SEQ ID NO:24 showed the most homolog, i.e., about 48% identity, 
20 between SEQ ID NO:24 and an Anas platychyncos thioesterase B gene. 

Comparison of nucleic acid sequence SEQ ID NO:30 (i.e., the coding strand of 
nucleic acid sequence of nfE8 2g01 ) with nucleic acid sequences reported in GenBank 
indicates that SEQ ID NO:30 showed the most homolog, i.e., about 46% identity, 
between SEQ ID NO:30 and a Mus musculus carboxyl ester lipase gene. 
25 Comparison of nucleic acid sequence SEQ ID NO:36 (i.e., the coding strand ox 

nucleic acid sequence of nfE9 20 o 7 ) with nucleic acid sequences reported in GenBank 
indicates that SEQ ID NO:36 showed the most homolog, i.e., about 47% identity, 
between SEQ ID NO:36 and a hamster mRNA for CE precursor gene. 

Comparison of nucleic acid sequence SEQ ID NO:67 with nucleic acid 
30 sequences reported in GenBank indicates that SEQ ID NO:67 showed the most 
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homology, i.e., about 48% identity, between SEQ ID NO:67 and a Lucilia cuprina alpha 
esterase gene (genembl # U56636) gene. 

Preferred arthropod esterase nucleic acid molecules include nucleic acid 
molecules having a nucleic acid sequence that is at least about 55%, preferably at least 
about 60%, more preferably at least about 65%, more preferably at least about 70%, 
more preferably at least about 75%, more preferably at least about 80%, more preferably 
at least about 90%, and even more preferably at least about 95% identical to nucleic acid 
sequence SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, 
SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO: 15, SEQ 
ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:21, SEQ ID 
NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID 
NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:33, SEQ ID 
NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:51, SEQ ID 
NO:52, SEQ ID NO:57, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID 
NO:67, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID 
NO:76 and/or a nucleic acid molecule encoding a protein comprising amino acid 

sequence SEQ ID NO:74. 

Another preferred nucleic acid molecule of the present invention includes at least 
a portion of nucleic acid sequence SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, 
SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO:20, SEQ 
ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:57, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
NO:61, SEQ ID NO:67, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID 
NO:72, SEQ ID NO:76 and/or a nucleic acid molecule encoding a protein comprising 
amino acid sequence SEQ ID NO:74, that is capable of hybridizing to a C.felis esterase 
gene of the present invention, as well as allelic variants thereof. A more preferred 
nucleic acid molecule includes the nucleic acid sequence SEQ ID NO:l, SEQ ID NO:3, 
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SEQ ID N0:4, SEQ ID N0:6, SEQ ID NO:7, SEQ ID N0:9, SEQ ID NO: 10, SEQ ID 
NO:12, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID 
NO:18, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
5 NO:30, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
NO:36, SEQ ID NO:38, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:57, SEQ ID 
NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:67, SEQ ID NO:69, SEQ ID 
NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:76 and/or a nucleic acid molecule 
encoding a protein comprising amino acid sequence SEQ ID NO:74, as well as allelic 
10 variants thereof. Such nucleic acid molecules can include nucleotides in addition to 

those included in the SEQ ID NOs, such as, but not limited to, a full-length gene, a full- 
length coding region, a nucleic acid molecule encoding a fusion protein, or a nucleic acid 
molecule encoding a multivalent protective compound. Particularly preferred nucleic 
acid molecules include nfEl 40l , nfE2 364 , nfE3 42 „ nfE4 524 , nfE5 1982 , nfE5 1515 , nfE5 2144 , 
15 nfE5 I650 , nfE6 148S , nfE6 1792 , nfE6 1650 , nfE7 2836 , nfE7 178g> nfE7 lvl0 , nfE7 650 , nfE8 2S01 , 
nfE8 1785 , nfE8, 710 , nfE9 200 7, nfE9 1584 , nfE9 1540 , nfE10 19S7 and nfE10 1590 . 

The present invention also includes a nucleic acid molecule encoding a protein 
having at least a portion of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO.S, SEQ ID 
NO:ll, SEQ ID NO:14, SEQ ID NO:19, SEQ ID NO:25, SEQ ID NO:31, SEQ ID 
20 NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID 
NO:43, SEQ ID NO:44, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 
NO:58, SEQ ID NO:68, SEQ ID NO:73 and/or SEQ ID NO:74, including nucleic acid 
molecules that have been modified to accommodate codon usage properties of the cells 
in which such nucleic acid molecules are to be expressed. 
25 Knowing the nu„l ,ic acid sequences of certain arthropod esterase nuclei c acid 

molecules of the present invention allows one skilled in the art to, for example, (a) make 
copies of those nucleic acid molecules, (b) obtain nucleic acid molecules including at 
least a portion of such nucleic acid molecules (e.g., nucleic acid molecules including 
full-length genes, full-length coding regions, regulatory control sequences, truncated 
30 coding regions), and (c) obtain esterase nucleic acid molecules from other arthropods. 



-38- 



FC-1-C1-PCT 



Such nucleic acid molecules can be obtained in a variety of ways including screening 
appropriate expression libraries with antibodies of the present invention; traditional 
cloning techniques using oligonucleotide probes of the present invention to screen 
appropriate libraries or DNA; and PCR amplification of appropriate libraries or DNA 
5 using oligonucleotide primers of the present invention. Preferred libraries to screen or 
from which to amplify nucleic acid molecule include flea pre-pupal, 3rd instar or adult 
cDNA libraries as well as genomic DNA libraries. Similarly, preferred DNA sources to 
screen or from which to amplify nucleic acid molecules include flea pre-pupal, 3rd instar 
or adult cDNA and genomic DNA. Techniques to clone and amplify genes are 
1 0 disclosed, for example, in Sambrook et al., ibid. 

The present invention also includes nucleic acid molecules that are 
oligonucleotides capable of hybridizing, under stringent hybridization conditions, with 
complementary regions of other, preferably longer, nucleic acid molecules of the present 
invention such as those comprising arthropod esterase genes or other arthropod esterase 
nucleic acid molecules. Oligonucleotides of the present invention can be RNA, DNA, or 
derivatives of either. The minimum size of such oligonucleotides is the size required for 
formation of a stable hybrid between an oligonucleotide and a complementary sequence 
on a nucleic acid molecule of the present invention. Minimal size characteristics are 
disclosed herein. The present invention includes oligonucleotides that can be used as, 
for example, probes to identify nucleic acid molecules, primers to produce nucleic acid 
molecules or therapeutic reagents to inhibit esterase protein production or activity (e.g., 
as antisense-, triplex formation-, ribozyme- and/or RNA drug-based reagents). The 
present invention also includes the use of such oligonucleotides to protect animals from 
disease using one or more of such technologies. Appropriate oligonucleotide-containing 
2 > therapeutic compositions can be administered to an animal u,ing techniques known to 

those skilled in the art. 

One embodiment of the present invention includes a recombinant vector, which 
includes at least one isolated nucleic acid molecule of the present invention, inserted into 
any vector capable of delivering the nucleic acid molecule into a host cell. Such a vector 
30 contains heterologous nucleic acid sequences, that is nucleic acid sequences that are not 
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naturally found adjacent to nucleic acid molecules of the present invention and that 
preferably are derived from a species other than the species from which the nucleic acid 
molecule(s) are derived. The vector can be either RNA or DNA, either prokaryotic or 
eukaryotic, and typically is a virus or a plasmid. Recombinant vectors can be used in the 
5 cloning, sequencing, and/or otherwise manipulation of arthropod esterase nucleic acid 
molecules of the present invention. 

One type of recombinant vector, referred to herein as a recombinant molecule, 
comprises a nucleic acid molecule of the present invention operatively linked to an 
expression vector. The phrase operatively linked refers to insertion of a nucleic acid 
1 0 molecule into an expression vector in a manner such that the molecule is able to be 
expressed when transformed into a host cell. As used herein, an expression vector is a 
DNA or RNA vector that is capable of transforming a host cell and of effecting 
expression of a specified nucleic acid molecule. Preferably, the expression vector is also 
capable of replicating within the host cell. Expression vectors can be either prokaryotic 
15 or eukaryotic, and are typically viruses or plasmids. Expression vectors of the present 
invention include any vectors that function (i.e., direct gene expression) in recombinant 
cells of the present invention, including in bacterial, fungal, endoparasite, insect, other 
animal, and plant cells. Preferred expression vectors of the present invention can direct 
gene expression in bacterial, yeast, insect and mammalian cells and more preferably in 
20 the cell types disclosed herein. 

In particular, expression vectors of the present invention contain regulatory 
sequences such as transcription control sequences, translation control sequences, origins 
of replication, and other regulatory sequences that are compatible with the recombinant 
cell and that control the expression of nucleic acid molecules of the present invention. 
25 In particular, recombinant molecules of the present invention include transci ipaon 
control sequences. Transcription control sequences are sequences which control the 
initiation, elongation, and termination of transcription. Particularly important 
transcription control sequences are those which control transcription initiation, such as 
promoter, enhancer, operator and repressor sequences. Suitable transcription control 
30 sequences include any transcription control sequence that can function in at least one of 
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the recombinant cells of the present invention. A variety of such transcription control 
sequences are known to those skilled in the art. Preferred transcription control 
sequences include those which function in bacterial, yeast, insect and mammalian cells, 
such as, but not limited to, tac, lac, trp, trc, oxy-pro, omp/lpp, rmB, bacteriophage 

5 lambda(such as lambda p L and lambda p R and fusions that include such promoters), 
bacteriophage T7, lilac, bacteriophage T3, bacteriophage SP6, bacteriophage SP01, 
metallothionein, alpha-mating factor, Pichia alcohol oxidase, alphavirus subgenomic 
promoters (such as Sindbis virus subgenomic promoters), antibiotic resistance gene, 
baculovirus, Heliothis zea insect virus, vaccinia virus, herpesvirus, raccoon poxvirus, 

1 0 other poxvirus, adenovirus, cytomegalovirus (such as intermediate early promoters), 
simian virus 40, retrovirus, actin, retroviral long terminal repeat, Rous sarcoma virus, 
heat shock, phosphate and nitrate transcription control sequences as well as other 
sequences capable of controlling gene expression in prokaryotic or eukaryotic cells. 
Additional suitable transcription control sequences include tissue-specific promoters and 

1 5 enhancers as well as lymphokine-inducible promoters (e.g., promoters inducible by 

interferons or interleukins). Transcription control sequences of the present invention can 
also include naturally occurring transcription control sequences naturally associated with 

arthropods, such as, C.felis. 

Suitable and preferred nucleic acid molecules to include in recombinant vectors 

20 of the present invention are as disclosed herein. Preferred nucleic acid molecules to 
include in recombinant vectors, and particularly in recombinant molecules, include 
nfEl 40 „ nfE2 364 , nfE3 421 , nfE4 524 , nfE5 I982 , nfE5 I515 , nfE5 2144 , nfE5 1650 , nfE6 I4S8 , nfE6 1792 , 
nfE6 1650 , nfE7 2836 , nfE7 1788 , nfE7 1710 , nffi7 050 , nfE8 2801 , n£E8 1785 , nfE8 1710 , nfE9 2007 , 
nf£9 1584 , nfE9 1540 , nfE10 1987 and/or nfE10 1590 . Particularly preferred recombinant 

25 molecules of the present invention include pLro-nffi6 1488 , pTrc-nfE7 650 , pTrc-nfE7 1710) 
P Trc-nfE8 1710 . P Trc-nfE5 1650 , P Trc-nfE9 1540 , pFB-nfE6 1079 , P VL-nfE7 1802 , pVL-fE8 1792 and 
P VL-nfE9 160 o, the production of which are described in the Examples section. 

Recombinant molecules of the present invention may also (a) contain secretory 
signals (i.e., signal segment nucleic acid sequences) to enable an expressed arthropod 

30 protein of the present invention to be secreted from the cell that produces the protein 
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and/or (b) contain fusion sequences which lead to the expression of nucleic acid 
molecules of the present invention as fusion proteins. Examples of suitable signal 
segments include any signal segment capable of directing the secretion of a protein of 
the present invention. Preferred signal segments include, but are not limited to, tissue 
plasminogen activator (t-PA), interferon, interleukin, growth hormone, 
histocompatibility and viral envelope glycoprotein signal segments, as well as natural 
signal sequences. Suitable fusion segments encoded by fusion segment nucleic acids are 
disclosed herein. In addition, a nucleic acid molecule of the present invention can be 
joined to a fusion segment that directs the encoded protein to the proteosome, such as a 
ubiquitin fusion segment. Recombinant molecules may also include intervening and/or 
untranslated sequences surrounding and/or within the nucleic acid sequences of nucleic 
acid molecules of the present invention. 

Another embodiment of the present invention includes a recombinant cell 
comprising a host cell transformed with one or more recombinant molecules of the 
present invention. Transformation of a nucleic acid molecule into a cell can be 
accomplished by any method by which a nucleic acid molecule can be inserted into the 
cell. Transformation techniques include, but are not limited to, transfection, 
electroporation, microinjection, lipofection, adsorption, and protoplast fusion. A 
recombinant cell may remain unicellular or may grow into a tissue, organ or a 
multicellular organism. Transformed nucleic acid molecules of the present invention 
can remain extrachromosomal or can integrate into one or more sites within a 
chromosome of the transformed (i.e., recombinant) cell in such a manner that their 
ability to be expressed is retained. Preferred nucleic acid molecules with which to 
transform a cell include arthropod esterase nucleic acid molecules disclosed herein. 
I Particularly pr.fured nucleic acid molecules with which to transform a cell include 
nfEU,. nfE2 3M , nfE3«„ nfE4 524 , nfE5 1982 , nfE5 1515 , nfE5 214 „ nfE5 1650 , nfE6 14S8 , nfE6 1792 , 
nfE6 1650 , nfE7 2830 , nfE7 1788 , nfE7 17l0 , nfE7 650 , nffiW nfE8 1785 , nffl8 I71Q> nfE9 2007 , 
nfE9 1584 , nfE9 1S 4o, nfE10, 987 and/or nfE10 1590 . 

Suitable host cells to transform include any cell that can be transformed with a 
0 nucleic acid molecule of the present invention. Host cells can be either untransformed 
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cells or cells that are already transformed with at least one nucleic acid molecule (e.g., 
nucleic acid molecules encoding one or more proteins of the present invention and/or 
other proteins useful in the production of multivalent vaccines). Host cells of the present 
invention either can be endogenously (i.e., naturally) capable of producing arthropod 
esterase proteins of the present invention or can be capable of producing such proteins 
after being transformed with at least one nucleic acid molecule of the present invention. 
Host cells of the present invention can be any cell capable of producing at least one 
protein of the present invention, and include bacterial, fungal (including yeast), parasite, 
other insect, other animal and plant cells. Preferred host cells include bacterial, 
mycobacterial, yeast, insect and mammalian cells. More preferred host cells include 
Salmonella, Escherichia, Bacillus, Listeria, Saccharomyces, Spodoptera, Mycobacteria, 
Trichoplusia, BHK (baby hamster kidney) cells, MDCK cells (normal dog kidney cell 
line for canine herpesvirus cultivation), CRFK cells (normal cat kidney cell line for 
feline herpesvirus cultivation), CV-1 cells (African monkey kidney cell line used, for 
example, to culture raccoon poxvirus), COS (e.g., COS-7) cells, and Vero cells. 
Particularly preferred host cells are Escherichia coli, including E. coli K-12 derivatives; 
Salmonella typhi; Salmonella typhimurium, including attenuated strains such as UK-1 
x 3987 and SR-U x 4072; Spodoptera frugiperda; Trichoplusia ni; BHK cells; MDCK 
cells; CRFK cells; CV-1 cells; COS cells; Vero cells; and non-tumorigenic mouse 
myoblast G8 cells (e.g., ATCC CRL 1246). Additional appropriate mammalian cell 
hosts include other kidney cell lines, other fibroblast cell lines (e.g., human, murine or 
chicken embryo fibroblast cell lines), myeloma cell lines, Chinese hamster ovary cells, 
mouse NIH/3T3 cells, LMTK 31 cells and/or HeLa cells. In one embodiment, the 
proteins may be expressed as heterologous proteins in myeloma cell lines employing 
immunoglobulin promoters. 

A recombinant cell is preferably produced by transforming a host cell with one or 
more recombinant molecules, each comprising one or more nucleic acid molecules of the 
present invention operatively linked to an expression vector containing one or more 
transcription control sequences. The phrase operatively linked refers to insertion of a 
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nucleic acid molecule into an expression vector in a manner such that the molecule is 
able to be expressed when transformed into a host cell. 

A recombinant molecule of the present invention is a molecule that can include at 
least one of any nucleic acid molecule heretofore described operatively linked to at least 
5 one of any transcription control sequence capable of effectively regulating expression of 
the nucleic acid molecule(s) in the cell to be transformed, examples of which are 
disclosed herein. Particularly preferred recombinant molecules include P Cro-nfE6 14SS , 
P Trc-nfE7 650 , P Trc-nfE7 lvl0) pTrc-nfE8 17l0 . P Trc-niE5 1650 , P Trc-nfE9 1540 , P FB-nfE6 l679 , 
pVL-nfE7 lg02 , P VL-£E8 1792 and P VL-nfE9 1600 . 
10 A recombinant cell of the present invention includes any cell transformed with at 

least one of any nucleic acid molecule of the present invention. Suitable and preferred 
nucleic acid molecules as well as suitable and preferred recombinant molecules with 
which to transform cells are disclosed herein. Particularly preferred recombinant cells 
include E. CO //:pCro-nffi6, 4g8 , E. co//: P Trc-nfE7, 710> E c O //:pTrc-nfE7 650 , E. co//: P Trc- 
15 nfE8 I710 , E. co//: P Trc-nfE5 I650 , E. co//: P Trc-nfE9 1540 , S. fntgiperda : P VL-nfE7 ls02 , S. 
frugiperda:pVL-nfES liq2 , S. frugiperda:pVL-nfE9 1600 and S.frugiperda:p¥B-niE6 i(lV) . 
Details regarding the production of these recombinant cells are disclosed herein. 

Recombinant cells of the present invention can also be co-transformed with one 
or more recombinant molecules including arthropod esterase nucleic acid molecules 
20 encoding one or more proteins of the present invention and one or more other nucleic 
acid molecules encoding other protective compounds, as disclosed herein (e.g., to 

produce multivalent vaccines). 

Recombinant DNA technologies can be used to improve expression of 
transformed nucleic acid molecules by manipulating, for example, the number of copies 
25 of the nv cleic acid to >lecules within a host cell, the efficiency with which those nucleic 
acid molecules are transcribed, the efficiency with which the resultant transcripts are 
translated, and the efficiency of post-translational modifications. Recombinant 
techniques useful for increasing the expression of nucleic acid molecules of the present 
invention include, but are not limited to, operatively linking nucleic acid molecules to 
30 high-copy number plasmids, integration of the nucleic acid molecules into one or more 
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host cell chromosomes, addition of vector stability sequences to plasmids, substitutions 
or modifications of transcription control signals (e.g., promoters, operators, enhancers), 
substitutions or modifications of translational control signals (e.g., ribosome binding 
sites, Shine-Dalgamo sequences), modification of nucleic acid molecules of the present 
5 invention to correspond to the codon usage of the host cell, deletion of sequences that 
destabilize transcripts, and use of control signals that temporally separate recombinant 
cell growth from recombinant enzyme production during fermentation. The activity of 
an expressed recombinant protein of the present invention may be improved by 
fragmenting, modifying, or derivatizing nucleic acid molecules encoding such a protein. 
1 0 Isolated esterase proteins of the present invention can be produced in a variety of 

ways, including production and recovery of natural proteins, production and recovery of 
recombinant proteins, and chemical synthesis of the proteins. In one embodiment, an 
isolated protein of the present invention is produced by culturing a cell capable of 
expressing the protein under conditions effective to produce the protein, and recovering 
15 the protein. A preferred cell to culture is a recombinant cell of the present invention. 
Effective culture conditions include, but are not limited to, effective media, bioreactor, 
temperature, pH and oxygen conditions that permit protein production. An effective 
medium refers to any medium in which a cell is cultured to produce an arthropod 
esterase protein of the present invention. Such medium typically comprises an aqueous 
20 medium having assimilable carbon, nitrogen and phosphate sources, and appropriate 
salts, minerals, metals and other nutrients, such as vitamins. Cells of the present 
invention can be cultured in conventional fermentation bioreactors, shake flasks, test 
tubes, microliter dishes, and petri plates. Culturing can be carried out at a temperature, 
pH and oxygen content appropriate for a recombinant cell. Such culturing conditions are 
25 within the expertise of one of ordinary skill in the art. Examples of suitable conditions 
are included in the Examples section. 

Depending on the vector and host system used for production, resultant proteins 
of the present invention may either remain within the recombinant cell; be secreted into 
the fermentation medium; be secreted into a space between two cellular membranes, 
30 such as the periplasmic space in E. coli; or be retained on the outer surface of a cell or 
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viral membrane. The phrase "recovering the protein", as well as similar phrases, refers 
to collecting the whole fermentation medium containing the protein and need not imply 
additional steps of separation or purification. Proteins of the present invention can be 
purified using a variety of standard protein purification techniques, such as, but not 

5 limited to, affinity chromatography, ion exchange chromatography, filtration, 

electrophoresis, hydrophobic interaction chromatography, gel filtration chromatography, 
reverse phase chromatography, concanavalin A chromatography, chromatofocusing and 
differential solubilization. Proteins of the present invention are preferably retrieved in 
"substantially pure" form. As used herein, "substantially pure" refers to a purity that 

10 allows for the effective use of the protein as a therapeutic composition or diagnostic. A 
therapeutic composition for animals, for example, should exhibit no substantial toxicity 
and preferably should be capable of stimulating the production of antibodies in a treated 
animal. 

The present invention also includes isolated (i.e., removed from their natural 
1 5 milieu) antibodies that selectively bind to an arthropod esterase protein of the present 
invention or a mimetope thereof (i.e., anti-arthropod esterase antibodies). As used 
herein, the term "selectively binds to" an esterase protein refers to the ability of 
antibodies of the present invention to preferentially bind to specified proteins and 
mimetopes thereof of the present invention. Binding can be measured using a variety of 
20 methods standard in the art including enzyme immunoassays (e.g., ELISA), immunoblot 
assays, etc.; see, for example, Sambrook et al., ibid. An anti-arthropod esterase antibody 
preferably selectively binds to an arthropod esterase protein in such a way as to reduce 
the activity of that protein. 

Isolated antibodies of the present invention can include antibodies in a bodily 
25 fluid v „u ;h as, but not limited to, serum), or antibodies tha* hr.ve been purified to varying 
degrees. Antibodies of the present invention can be polyclonal or monoclonal, 
functional equivalents such as antibody fragments and genetically-engineered 
antibodies, including single chain antibodies or chimeric antibodies that can bind to 
more than one epitope. 
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A preferred method to produce antibodies of the present invention includes (a) 
administering to an animal an effective amount of a protein, peptide or mimetope thereof 
of the present invention to produce the antibodies and (b) recovering the antibodies. In 
another method, antibodies of the present invention are produced recombinantly using 
5 techniques as heretofore disclosed to produce arthropod esterase proteins of the present 
invention. Antibodies raised against defined proteins or mimetopes can be advantageous 
because such antibodies are not substantially contaminated with antibodies against other 
substances that might otherwise cause interference in a diagnostic assay or side effects if 
used in a therapeutic composition. 
1 0 Antibodies of the present invention have a variety of potential uses that are 

within the scope of the present invention. For example, such antibodies can be used (a) 
as therapeutic compounds to passively immunize an animal in order to protect the 
animal from arthropods susceptible to treatment by such antibodies and/or (b) as tools to 
screen expression libraries and/or to recover desired proteins of the present invention 
1 5 from a mixture of proteins and other contaminants. Furthermore, antibodies of the 

present invention can be used to target cytotoxic agents to hematophagous ectoparasites 
such as those discloses herein, in order to directly kill such hematophagous 
ectoparasites. Targeting can be accomplished by conjugating (i.e., stably joining) such 
antibodies to the cytotoxic agents using techniques known to those skilled in the art. 
20 Suitable cytotoxic agents are known to those skilled in the art. 

One embodiment of the present invention is a therapeutic composition that, when 
administered to an animal in an effective manner, is capable of protecting that animal 
from infestation by hematophagous ectoparasite. Therapeutic compositions of the 
present invention include at least one of the following protective compounds: an isolated 
25 hematophagous arthropod esterase protein (including a peptide); a mimetope of such a 
protein; an isolated nucleic acid molecule that hybridizes under stringent hybridization 
conditions with a Ctenocephalides felis esterase gene; an isolated antibody that 
selectively binds to an hematophagous arthropod esterase protein; and inhibitors of 
hematophagous arthropod esterase activity (including esterase substrate analogs). As 
30 used herein, a protective compound refers to a compound that, when administered to an 
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animal in an effective manner, is able to treat, ameliorate, and/or prevent disease caused 
by an arthropod of the present invention. Preferred arthropods to target are heretofore 
disclosed. Examples of proteins, nucleic acid molecules, antibodies and inhibitors of the 
present invention are disclosed herein. 
5 A preferred therapeutic composition of the present invention includes at least one 

of the following protective compounds: an isolated hematophagous ectoparasite 
carboxylesterase protein (including a peptide); a mimetope of such a protein; an isolated 
hematophagous ectoparasite carboxylesterase nucleic acid molecule that hybridizes 
under stringent hybridization conditions with a Ctenocephalides felis carboxylesterase 
1 0 gene; an isolated antibody that selectively binds to a hematophagous ectoparasite 

carboxylesterase protein; and an inhibitor of carboxylesterase activity identified by its 
ability to inhibit the activity of a flea carboxylesterase (including a substrate analog). 

Suitable inhibitors of esterase activity are compounds that interact directly with 
an esterase protein's active site, thereby inhibiting that esterase's activity, usually by 
1 5 binding to or otherwise interacting with or otherwise modifying the esterase's active site. 
Esterase inhibitors can also interact with other regions of the esterase protein to inhibit 
esterase activity, for example, by allosteric interaction. Inhibitors of esterases are 
usually relatively small compounds and as such differ from anti-esterase antibodies. 
Preferably, an esterase inhibitor of the present invention is identified by its ability to 
20 bind to, or otherwise interact with, a flea esterase protein, thereby inhibiting the activity 

of the flea esterase. 

Esterase inhibitors can be used directly as compounds in compositions of the 
present invention to treat animals as long as such compounds are not harmful to host 
animals being treated. Esterase inhibitors can also be used to identify preferred types of 
25 arthropod esterases to target using compositions of the pre,euc invention, for example by 
affinity chromatography. Preferred esterase inhibitors of the present invention include, 
but are not limited to, flea esterase substrate analogs, and other molecules that bind to a 
flea esterase (e.g., to an allosteric site) in such a manner that esterase activity of the flea 
esterase is inhibited; examples include, but are not limited to, juvenile hormone analogs 
30 and cholinesterase inhibitors as well as other neural transmission inhibitors. An esterase 
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substrate analog refers to a compound that interacts with (e.g., binds to, associates with, 
modifies) the active site of an esterase protein. A preferred esterase substrate analog 
inhibits esterase activity. Esterase substrate analogs can be of any inorganic or organic 
composition, and, as such, can be, but are not limited to, peptides, nucleic acids, and 
peptidomimetic compounds. Esterase substrate analogs can be, but need not be, 
structurally similar to an esterase's natural substrate as long as they can interact with the 
active site of that esterase protein. Esterase substrate analogs can be designed using 
computer-generated structures of esterase proteins of the present invention or computer 
structures of esterases' natural substrates. Substrate analogs can also be obtained by 
generating random samples of molecules, such as oligonucleotides, peptides, 
peptidomimetic compounds, or other inorganic or organic molecules, and screening such 
samples by affinity chromatography techniques using the corresponding binding partner, 
(e.g., a flea esterase). A preferred esterase substrate analog is a peptidomimetic 
compound (i.e., a compound that is structurally and/or functionally similar to a natural 
substrate of an esterase of the present invention, particularly to the region of the 
substrate that interacts with the esterase active site, but that inhibits esterase activity 
upon interacting with the esterase active site). 

Esterase peptides, mimetopes and substrate analogs, as well as other protective 
compounds, can be used directly as compounds in compositions of the present invention 
to treat animals as long as such compounds are not harmful to the animals being treated. 

The present invention also includes a therapeutic composition comprising at least 
one arthropod esterase-based compound of the present invention in combination with at 
least one additional compound protective against hematophagous ectoparasite 
infestation. Examples of such compounds are disclosed herein. 

In one embodiment, a therapeutic composition of the present invention c-n '.e 
used to protect an animal from hematophagous ectoparasite infestation by administering 
such composition to a hematophagous ectoparasite, such as to a flea, in order to prevent 
infestation. Such administration could be oral, or by application to the environment 
(e.g., spraying). Examples of such compositions include, but are not limited to, 
) transgenic vectors capable of producing at least one therapeutic composition of the 
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present invention. In another embodiment, a hematophagous ectoparasite, such as a flea, 
can ingest therapeutic compositions, or products thereof, present in the blood of a host 
animal that has been administered a therapeutic composition of the present invention. 

Compositions of the present invention can be administered to any animal 
susceptible to hematophagous ectoparasite infestation (i.e., a host animal), including 
warm-blooded animals. Preferred animals to treat include mammals and birds, with 
cats, dogs, humans, cattle, chinchillas, ferrets, goats, mice, minks, rabbits, raccoons, rats, 
sheep, squirrels, swine, chickens, ostriches, quail and turkeys as well as other furry 
animals, pets, zoo animals, work animals and/or food animals, being more preferred. 
Particularly preferred animals to protect are cats and dogs. 

In accordance with the present invention, a host animal (i.e., an animal that is or 
is capable of being infested with a hematophagous ectoparasite) is treated by 
administering to the animal a therapeutic composition of the present invention in such a 
manner that the composition itself (e.g., an esterase inhibitor, an esterase synthesis 
suppressor (i.e., a compound that decreases the production of esterase in the 
hematophagous ectoparasite), an esterase mimetope, or an anti-esterase antibody) or a 
product generated by the animal in response to administration of the composition (e.g., 
antibodies produced in response to administration of an arthropod esterase protein or 
nucleic acid molecule, or conversion of an inactive inhibitor "prodrug" to an active 
esterase inhibitor) ultimately enters the hematophagous ectoparasite. A host animal is 
preferably treated in such a way that the compound or product thereof enters the blood 
stream of the animal. Hematophagous ectoparasites are then exposed to the composition 
or product when they feed from the animal. For example, flea esterase inhibitors 
administered to an animal are administered in such a way that the inhibitors enter the 
blood stream of the animal, where they can be tak.n up by feeding fleas. In another 
embodiment, when a host animal is administered an arthropod esterase protein or nucleic 
acid molecule, the treated animal mounts an immune response resulting in the 
production of antibodies against the esterase (i.e., anti-esterase antibodies) which 
circulate in the animal's blood stream and are taken up by hematophagous ectoparasites 
) upon feeding. Blood taken up by hematophagous ectoparasites enters the 
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hematophagous ectoparasites where compounds of the present invention, or products 
thereof, such as anti-esterase antibodies, esterase inhibitors, esterase mimetopes and/or 
esterase synthesis suppressors, interact with, and reduce esterase activity in the 
hematophagous ectoparasite. 

The present invention also includes the ability to reduce larval hematophagous 
ectoparasite infestation in that when hematophagous ectoparasites feed from a host 
animal that has been administered a therapeutic composition of the present invention, at 
least a portion of compounds of the present invention, or products thereof, in the blood 
taken up by the hematophagous ectoparasite are excreted by the hematophagous 
ectoparasite in feces, which is subsequently ingested by hematophagous ectoparasite 
larvae. In particular, it is of note that flea larvae obtain most, if not all, of their nutrition 
from flea feces. 

In accordance with the present invention, reducing esterase activity in a 
hematophagous ectoparasite can lead to a number of outcomes that reduce 
i hematophagous ectoparasite burden on treated animals and their surrounding 

environments. Such outcomes include, but are not limited to, (a) reducing the viability 
of hematophagous ectoparasites that feed from the treated animal, (b) reducing the 
fecundity of female hematophagous ectoparasites that feed from the treated animal, (c) 
reducing the reproductive capacity of male hematophagous ectoparasites that feed from 
20 the treated animal, (d) reducing the viability of eggs laid by female hematophagous 

ectoparasites that feed from the treated animal, (e) altering the blood feeding behavior of 
hematophagous ectoparasites that feed from the treated animal (e.g., hematophagous 
ectoparasites take up less volume per feeding or feed less frequently), (f) reducing the 
viability of hematophagous ectoparasite larvae, for example due to the feeding of larvae 
25 from feces of hemator hagous ectoparasites that feed from the treated animal and o" (g) 
altering the development of hematophagous ectoparasite larvae (e.g., by decreasing 
feeding behavior, inhibiting growth, inhibiting (e.g., slowing or blocking) molting, 
and/or otherwise inhibiting maturation to adults). 

Therapeutic compositions of the present invention also include excipients in 
30 which protective compounds are formulated. An excipient can be any material that the 
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animal to be treated can tolerate. Examples of such excipients include water, saline, 
Ringer's solution, dextrose solution, Hank's solution, and other aqueous physiologically 
balanced salt solutions. Nonaqueous vehicles, such as fixed oils, sesame oil, ethyl 
oleate, or triglycerides may also be used. Other useful formulations include suspensions 
containing viscosity enhancing agents, such as sodium carboxymethylcellulose, sorbitol, 
or dextran. Excipients can also contain minor amounts of additives, such as substances 
that enhance isotonicity and chemical stability. Examples of buffers include phosphate 
buffer, bicarbonate buffer and Tris buffer, while examples of preservatives include 
thimerosal or o-cresol, formalin and benzyl alcohol. Standard formulations can either be 
liquid injectables or solids which can be taken up in a suitable liquid as a suspension or 
solution for injection. Thus, in a non-liquid formulation, the excipient can comprise 
dextrose, human serum albumin, dog serum albumin, cat serum albumin, preservatives, 
etc., to which sterile water or saline can be added prior to administration. 

In one embodiment of the present invention, a therapeutic composition can 
include an adjuvant. Adjuvants are agents that are capable of enhancing the immune 
response of an animal to a specific antigen. Suitable adjuvants include, but are not 
limited to, cytokines, chemokines, and compounds that induce the production of 
cytokines and chemokines (e.g., granulocyte macrophage colony stimulating factor 
(GM-CSF), granulocyte colony stimulating factor (G-CSF), macrophage colony 
stimulating factor (M-CSF), colony stimulating factor (CSF), erythropoietin (EPO), 
interleukin 2 (IL-2), interleukin-3 (IL-3), interleukin 4 (IL-4), interleukin 5 (IL-5), 
interleukin 6 (IL-6), interleukin 7 (IL-7), interleukin 8 (IL-8), interleukin 10 (IL-10), 
interleukin 12 (IL-12), interferon gamma, interferon gamma inducing factor I (IGIF), 
transforming growth factor beta, RANTES (regulated upon activation, normal T cell 
expressed and presumably secreted), macrophage inflammatory proteins (e.g., MIP-1 
alpha and MIP-1 beta), and Leishmania elongation initiating factor (LEIF); bacterial 
components (e.g., endotoxins, in particular superantigens, exotoxins and cell wall 
components); aluminum-based salts; calcium-based salts; silica; polynucleotides; 
toxoids; serum proteins, viral coat proteins; block copolymer adjuvants (e.g., Hunter's 
Titermax™ adjuvant (Vaxcel™, Inc. Norcross, GA), Ribi adjuvants (Ribi ImmunoChem 



.52- FC-1-C1-PCT 

Research, Inc., Hamilton, MT); and saponins and their derivatives (e.g., Quil A 
(Superfos Biosector A/S, Denmark). Protein adjuvants of the present invention can be 
delivered in the form of the protein themselves or of nucleic acid molecules encoding 
such proteins using the methods described herein. 
5 In one embodiment of the present invention, a therapeutic composition can 

include a carrier. Carriers include compounds that increase the half-life of a therapeutic 
composition in the treated animal. Suitable carriers include, but are not limited to, 
polymeric controlled release vehicles, biodegradable implants, liposomes, bacteria, 
viruses, other cells, oils, esters, and glycols. 
1 0 One embodiment of the present invention is a controlled release formulation that 

is capable of slowly releasing a composition of the present invention into an animal. As 
used herein, a controlled release formulation comprises a composition of the present 
invention in a controlled release vehicle. Suitable controlled release vehicles include, 
but are not limited to, biocompatible polymers, other polymeric matrices, capsules, 
1 5 microcapsules, microparticles, bolus preparations, osmotic pumps, diffusion devices, 
liposomes, lipospheres, and transdermal delivery systems. Other controlled release 
formulations of the present invention include liquids that, upon administration to an 
animal, form a solid or a gel in situ. Preferred controlled release formulations are 
biodegradable (i.e., bioerodible). 
20 A preferred controlled release formulation of the present invention is capable of 

releasing a composition of the present invention into the blood of an animal at a constant 
rate sufficient to attain therapeutic dose levels of the composition to protect an animal 
from hematophagous ectoparasite infestation. The therapeutic composition is preferably 
released over a period of time ranging from about 1 to about 12 months. A preferred 
25 controlled release formulation of the present invention is capable of effr -* ig a treatment 
preferably for at least about 1 month, more preferably for at least about 3 months, even 
more preferably for at least about 6 months, even more preferably for at least about 9 
months, and even more preferably for at least about 12 months. 

Acceptable protocols to administer therapeutic compositions of the present 
30 invention in an effective manner include individual dose size, number of doses, 
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frequency of dose administration, and mode of administration. Determination of such 
protocols can be accomplished by those skilled in the art. A suitable single dose is a 
dose that is capable of protecting an animal from disease when administered one or more 
times over a suitable time period. For example, a preferred single dose of a protein, 
5 mimetope or antibody therapeutic composition is from about 1 microgram (ug) to about 
10 milligrams (mg) of the therapeutic composition per kilogram body weight of the 
animal. Booster vaccinations can be administered from about 2 weeks to several years 
after the original administration. Booster administrations preferably are administered 
when the immune response of the animal becomes insufficient to protect the animal 
10 from disease. A preferred administration schedule is one in which from about 10 ug to 
about 1 mg of the therapeutic composition per kg body weight of the animal is 
administered from about one to about two times over a time period of from about 2 
weeks to about 12 months. Modes of administration can include, but are not limited to, 
subcutaneous, intradermal, intravenous, intranasal, oral, transdermal, intraocular and 
15 intramuscular routes. 

According to one embodiment, a nucleic acid molecule of the present invention 
can be administered to an animal in a fashion to enable expression of that nucleic acid 
molecule into a protective protein or protective RNA (e.g., antisense RNA, ribozyme, 
triple helix forms or RNA drug) in the animal. Nucleic acid molecules can be delivered 
20 to an animal in a variety of methods including, but not limited to, (a) administering a 

naked (i.e., not packaged in a viral coat or cellular membrane) nucleic acid vaccine (e.g., 
as naked DNA or RNA molecules, such as is taught, for example in Wolff et al., 1990, 
Science 247, 1465-1468) or (b) administering a nucleic acid molecule packaged as a 
recombinant virus vaccine or as a recombinant cell vaccine (i.e., the nucleic acid 
25 molecule is delivered by a viral or cellular -\ ehicle). 

A naked nucleic acid vaccine of the present invention includes a nucleic acid 
molecule of the present invention and preferably includes a recombinant molecule of the 
present invention that preferably is replication, or otherwise amplification,- competent. A 
naked nucleic acid vaccine of the present invention can comprise one or more nucleic 
30 acid molecules of the present invention in the form of, for example, a bicistronic 
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recombinant molecule having, for example one or more internal ribosome entry sites. 
Preferred naked nucleic acid vaccines include at least a portion of a viral genome (i.e., a 
viral vector). Preferred viral vectors include those based on alphaviruses, poxviruses, 
adenoviruses, herpesviruses, and retroviruses, with those based on alphaviruses (such as 
5 Sindbis or Semliki virus), species-specific herpesviruses and species-specific poxviruses 
being particularly preferred. Any suitable transcription control sequence can be used, 
including those disclosed as suitable for protein production. Particularly preferred 
transcription control sequence include cytomegalovirus intermediate early (preferably in 
conjunction with Intron-A), Rous Sarcoma Virus long terminal repeat, and tissue- 
10 specific transcription control sequences, as well as transcription control sequences 
endogenous to viral vectors if viral vectors are used. The incorporation of "strong" 
poly(A) sequences are also preferred. 

Naked nucleic acid vaccines of the present invention can be administered in a 
variety of ways, with intramuscular, subcutaneous, intradermal, transdermal, intranasal 
15 and oral routes of administration being preferred. A preferred single dose of a naked 

nucleic acid vaccines ranges from about 1 nanogram (ng) to about 100 ug, depending on 
the route of administration and/or method of delivery, as can be determined by those 
skilled in the art. Suitable delivery methods include, for example, by injection, as drops, 
aerosolized and/or topically. Naked DNA of the present invention can be contained in 
20 an aqueous excipient (e.g., phosphate buffered saline) alone or a carrier (e.g., lipid-based 
vehicles). 

A recombinant virus vaccine of the present invention includes a recombinant 
molecule of the present invention that is packaged in a viral coat and that can be 
expressed in an animal after administration. Preferably, the recombinant molecule is 
25 packaging ;ficient and/or encodes an attenuated virus. A number of recombinant 
viruses can be used, including, but not limited to, those based on alphaviruses, 
poxviruses, adenoviruses, herpesviruses, and retroviruses. Preferred recombinant virus 
vaccines are those based on alphaviruses (such as Sindbis virus), raccoon poxvirus*. 
. species-specific herpesviruses and species-specific poxviruses. An example of methods 
30 to produce and use alphavirus recombinant virus vaccines is disclosed in PCT 
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Publication No. WO 94/17813, by Xiong et al., published August 18, 1994, which is 
incorporated by reference herein in its entirety. 

When administered to an animal, a recombinant virus vaccine of the present 
invention infects cells within the immunized animal and directs the production of a 

5 protective protein or RNA nucleic acid molecule that is capable of protecting the animal 
from hematophagous ectoparasite infestation. For example, a recombinant virus vaccine 
comprising an arthropod CE nucleic acid molecule of the present invention is 
administered according to a protocol that results in the animal producing a sufficient 
immune response to protect itself from hematophagous ectoparasite infestation. A 

1 0 preferred single dose of a recombinant virus vaccine of the present invention is from 

about 1 x 10 4 to about 1 x 10 7 virus plaque forming units (pfu) per kilogram body weight 
of the animal. Administration protocols are similar to those described herein for protein- 
based vaccines, with subcutaneous, intramuscular, intranasal and oral administration 
routes being preferred. 

1 5 A recombinant cell vaccine of the present invention includes recombinant cells 

of the present invention that express at least one protein of the present invention. 
Preferred recombinant cells for this embodiment include Salmonella, E. coli, Listeria, 
Mycobacterium, S. frugiperda, yeast, (including Saccharomyces cerevisiae), BHK, CV- 
1 , myoblast G8, COS (e.g., COS-7), Vero, MDCK and CRFK recombinant cells. 
20 Recombinant cell vaccines of the present invention can be administered in a variety of 
ways but have the advantage that they can be administered orally, preferably at doses 
ranging from about 10 8 to about 10 12 cells per kilogram body weight. Administration 
protocols are similar to those described herein for protein-based vaccines. Recombinant 
cell vaccines can comprise whole cells, cells stripped of cell walls or cell lysates. 
25 The efficacy of a therapeutic composition of the present invention to protect an 

animal from hematophagous ectoparasite infestation can be tested in a variety of ways 
including, but not limited to, detection of anti-arthropod esterase antibodies (using, for 
example, proteins or mimetopes of the present invention), detection of cellular immunity 
within the treated animal, or challenge of the treated animal with hematophagous 
30 ectoparasites to determine whether, for example, the feeding, fecundity or viability of 
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hematophagous ectoparasites feeding from the treated animal is disrupted. Challenge 
studies can include attachment of chambers containing hematophagous ectoparasites 
onto the skin of the treated animal. In one embodiment, therapeutic compositions can be 
tested in animal models such as mice. Such techniques are known to those skilled in the 
5 art. 

One preferred embodiment of the present invention is the use of arthropod 
protective compounds, such as proteins, mimetopes, nucleic acid molecules, antibodies 
and inhibitory compounds of the present invention, to protect an animal from 
hematophagous ectoparasite, and particularly flea, infestation. Preferred protective 
1 0 compounds of the present invention include, but are not limited to, C. felis esterase 
nucleic acid molecules, C. felis esterase proteins and mimetopes thereof, anti-C. felis 
esterase antibodies, and inhibitors of C. felis esterase activity. More preferred protective 
compounds of the present invention include, but are not limited to, CE or JHE 
formulations of the present invention, C. felis CE nucleic acid molecules, C. felis CE 
15 proteins and mimetopes thereof, anti-flea CE antibodies, anti-flea JHE antibodies, 
inhibitors of C. felis CE activity and inhibitors of flea JHE activity. Additional 
protection may be obtained by administering additional protective compounds, including 
other proteins, mimetopes, nucleic acid molecules, antibodies and inhibitory compounds, 
as disclosed herein. 

20 One therapeutic composition of the present invention includes an inhibitor of 

arthropod esterase activity, i.e., a compound capable of substantially interfering with the 
function of an arthropod esterase susceptible to inhibition by an inhibitor of arthropod 
esterase activity. An inhibitor of esterase activity can be identified using arthropod 
esterase proteins of the present invention. One embodiment of the present invention is a 
25 method to identify a compound capable of inhibiting esterase r <* vity of an rrthropod. 
Such a method includes the steps of (a) contacting (e.g., combining, mixing) an isolated 
flea esterase protein, preferably a C. felis esterase protein of the present invention, with a 
putative inhibitory compound under conditions in which, in the absence of the 
compound, the protein has esterase activity, and (b) determining if the putative 
30 inhibitory compound inhibits the esterase activity. Putative inhibitory compounds to 
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screen include small organic molecules, antibodies (including mimetopes thereof) and 
substrate analogs. Methods to determine esterase activity are known to those skilled in 
the art; see, for example, the Examples section of the present application. 

The present invention also includes a test kit to identify a compound capable of 
5 inhibiting esterase activity of an arthropod. Such a test kit includes an isolated flea 
esterase protein, preferably a C. felis esterase protein, having esterase activity and a 
means for determining the extent of inhibition of esterase activity in the presence of (i.e., 
effected by) a putative inhibitory compound. Such compounds are also screened to 
identify those that are substantially not toxic in host animals. 
10 Esterase inhibitors isolated by such a method, and/or test kit, can be used to 

inhibit any esterase that is susceptible to such an inhibitor. Preferred esterase proteins to 
inhibit are those produced by arthropods. A particularly preferred esterase inhibitor of 
the present invention is capable of protecting an animal from hematophagous 
ectoparasite infestation. Effective amounts and dosing regimens can be determined 
1 5 using techniques known to those skilled in the art. 

° The following examples are provided for the purposes of illustration and are not 
intended to limit the scope of the present invention. 

EXAMPLES 

It is to be noted that the Examples include a number of molecular biology, 
20 microbiology, immunology and biochemistry techniques considered to be known to 
those skilled in the art. Disclosure of such techniques can be found, for example, in 
Sambrook et al., ibid., Borovsky, Arch Insect Biochem. and Phys., 7:187-210, 1988, and 
related references. 
Example 1 

25 This example describes '* ding of proteases and esterases with radiolabeled 

di i sopropyl fluorophosphate. 

Tissue samples were isolated from unfed or bovine blood-fed 1 st instar 
Ctenocephalides felis flea larvae; bovine blood-fed or cat blood-fed 3rd instar 
Ctenocephalides felis flea larvae; bovine blood-fed or cat blood-fed Ctenocephalides 
30 felis prepupal flea larvae; bovine blood-fed or cat blood-fed adult Ctenocephalides felis 
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flea midgut tissue, and whole unfed, bovine blood-fed or cat blood-fed adult 
Ctenocephalidesfelis fleas. The 1st instar, 3rd instar, prepupal and adult midgut tissues 
were then homogenized by freeze-fracture and sonicated in a Tris buffer comprising 50 
mM Tris, pH 8.0 and 100 mM CaCl 2 . The whole adult flea sample was then 
homogenized by freeze-fracture and ground with a microtube mortar and pestle. The 
extracts were centrifuged at about 14,000 x g for 20 minutes (min.) and the soluble 
material recovered. The soluble material was then diluted to a final concentration of 
about 1 to about 1.2 tissue equivalents per microliter (pi) of Tris buffer. Each sample 
was labeled with [l,3- 3 H]-diisopropylfluorophosphate ( 3 H-DFP) (available from 
DuPont-NEN, Wilmington, DE) using the method generally described in Borovsky, ibid. 
About 20 tissue equivalents of each tissue sample were mixed with about 1 pCi of 3 H- 
DFP and incubated for about 18 hours at 4°C. Proteins contained in each sample were 
then resolved using a 14% Tris-glycine sodium dodecyl sulfate polyacrylamide gel 
electrophoresis (SDS-PAGE) (available from Novex, San Diego, CA) under reducing 
conditions. The gel was soaked in Entensify (available from DuPont-NEN) according to 
manufacturers instructions, and exposed to X-ray film (available from Kodak X-Omat 
AR, Rochester, NY) for about 3 days at -70°C. 

Analysis of the resulting autoradiogram (shown in Fig. 1) indicated that tissue 
samples from 3rd instar, prepupal larvae and whole adult flea contained proteins that 
labeled with DFP, having a molecular weight (MW) of about 60 kilodalton (kD). No 
proteins of this MW were labeled in tissue samples from unfed or fed 1st instar larvae 
and adult midgut. The results indicated preferred tissue distribution and stage-specific 
expression of DFP-labeled serine esterases in fleas. 
Example 2 

Thio e cample describes the identification of general CE activity in flea tissue 
extracts. 

Tissue samples and soluble extracts were prepared as described above in 
Example 1 , except not labelled, from unfed (UF) and bovine blood-fed 1 st instar flea 
larvae, bovine blood-fed 3rd instar flea larvae, bovine blood-fed prepupal flea larvae, 
) unfed whole adult fleas, cat blood-fed adult (ACF) whole fleas, cat blood-fed adult fleas 
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10 



that have had their heads and midguts removed (referred to herein as fed adult partial 
fleas), unfed adult flea midguts and cat blood-fed adult flea midguts. About 5 tissue 
equivalents of each tissue were assayed for general CE activity using the following 
method. Tissue samples of about 5 pi were added to separate wells of flat-bottomed 
microliter plate (available from Becton Dickinson, Lincoln Park, NJ). A control well 
was prepared by adding about 5 pi of Tris buffer to an empty well of the plate. About 
95 pi of 25 mM Tris-HCl (pH 8.0) was then added to each sample to increase the 
volume in each well to about 100 pi. About 100 pi of 0.25 mM cc-napthyl acetate 
(available from Sigma, St. Louis, MO) dissolved in 25 mM Tris-HCl (pH 8.0) was then 
added to each well. The plate was then incubated for about 15 min. at 37°C. Following 
the incubation, about 40 pi of 0.3% Fast Blue salt BN (tetrazotized o-dianisidine; 
available from Sigma) dissolved in 3.3% SDS in water was added to each well. 

The microtiter plate was then analyzed using a Cambridge Technology, Inc. 
(Watertown, PA) model 7500 Microplate Reader set to 590 nm. The absorbance value 
for the control sample was subtracted from absorbance values of experimental samples, 
such that the background value was zero. 

The results shown in Fig. 2 indicated that general CE activity was detected in all 
tissue samples. The level of activity varied, with unfed and fed 1st instar larvae, unfed 
adult flea midguts, and fed adult flea midguts having relatively lower activity than in the 
20 other tissues. Thus, the results indicated preferred tissue distribution and stage-specific 
expression of general CE activity in fleas. 
Example 3 

This example describes the determination of general CE activity using isoelectric 
focusing (IEF)-PAGE and non-reducing SDS-PAGE. 
25 A. Non-reducing SDS-PAGE. 

Soluble extracts from unfed and bovine blood-fed 1st instar flea larvae, bovine 
blood-fed 3rd instar flea larvae, bovine blood-fed prepupal flea larvae, bovine blood-fed 
adult (ABF) whole fleas and cat blood-fed adult whole fleas were prepared using the 
method described in Example 1. Each soluble extract sample was combined with SDS 
30 sample buffer (available from Novex) and proteins in the samples were resolved by gel 
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electrophoresis using 14% Tris-glycine SDS electrophoresis gels (available from 
Novex). The gels were run at room temperature for about 1 hour at 200 volts. After 
electrophoresis, the gels were soaked for about for 30 minutes in 50 mM Tris, pH 8.0, 
containing 2.5% Triton X-100 to renature the proteins. The gels were then soaked in 50 
5 mM Tris, pH 8.0, for about 5 minutes and then stained for about 5 min. in 50 milliliters 
(ml) of 25 mM Tris, pH 8.0, containing 50 mg Fast blue salt BN and 10 mg a-napthyl 
acetate (dissolved in 1 ml acetone). Once protein was detected on the stained gels, the 
gels were rinsed with water and photographed. 
B. IEF-PAGE. 

1 0 Soluble extracts from unfed and bovine blood-fed 1 st instar flea larvae, bovine 

blood-fed 3rd instar flea larvae, bovine blood-fed prepupal flea larvae, unfed and cat 
blood-fed whole fleas, cat blood-fed adult partial fleas and cat blood-fed adult midguts 
were prepared as described above in Section A. The extracts were each combined with 
IEF sample buffer pH 3-7 (available from Novex) and loaded onto pH 3-7 IEF 

1 5 electrophoresis gels (available from Novex). The gels were electrophoresed at room 
temperature first for about 1 hour at about 100 volts, then for about 1 hour at about 200 
volts, and then for about 30 min. at about 500 volts. Following electrophoresis, the gels 
were soaked in 25 mM Tris buffer, pH 8.0, for about 5 min. and then stained for about 1- 
5 min. in 50 ml of 25 mM Tris buffer, pH 8.0, containing 50 mg Fast blue salt BN and 

20 1 0 mg a-napthyl acetate (dissolved in 1 ml acetone). Once protein was detected on the 
stained gels, the gels were rinsed with water and photographed. 
C. Results. 

The results from gel electrophoresis experiments described above in Sections A 
and B are shown in Figs. 3 and 4. The results indicated that certain flea tissues contain 

25 proteins having MW's of from about 60 to about 70 k^ ' nd nativt pi values of from 
about 4.7 to about 5.2 that have CE activity. In particular, CE activity was identified in 
prepupal larvae and fed adult flea extracts resolved by non-reduced SDS-PAGE. No CE 
activity was identified in unfed and fed 1st instar larvae or fed 3rd instar larvae extracts 
(see Fig. 3). When extracts were resolved by native IEF-PAGE, CE activity was 

30 identified in fed 3rd instar larvae, prepupal larvae, unfed and fed whole adult flea, and 
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fed adult partial flea extracts (see Fig. 4, lanes 3-7)). No CE activity was identified in 
unfed or fed 1st instar larvae, or in fed adult flea midgut extracts (see Fig. 4, lanes 1, 2, 
and 8). 
Example 4 

5 This example describes the purification of CE protein from prepupal flea larvae. 

About 15,000 bovine blood-fed prepupal flea larvae were collected and the 
larvae were homogenized in TBS by sonication in 50 ml Oak Ridge centrifuge tubes 
(available from Nalgene Co., Rochester, NY) by sonicating 4 times 20 seconds each at a 
setting of 5 of a model W-380 Sonicator (available from Heat Systems-Ultrasonics, 

10 Inc.). The sonicates were clarified by centrifugation at 18,000 RPM for 30 minutes to 
produce an extract. Soluble protein in the extract was removed by aspiration and diluted 
to a volume of about 20 ml in TBS (equivalent to about 1 larva per \x\ TBS). The extract 
was then added to a column containing about 5 ml of p-aminobenzamidine linked to 
agarose beads (available from Sigma, St. Louis, MO) and incubated overnight at 4°C. 

15 The column was then washed with about 30 ml TBS to remove unbound protein. The 
collected unbound protein was then concentrated to a volume of about 20 ml using a 
Macrosep 10 centrifugal protein concentrator (Filtron Technology Corp., Northborough, 
MA) and filtered sequentially through a 1 \xm syringe filter and then through a 0.2 jim 
syringe filter to clarify the sample for chromatography. 

20 Aliquots of about 0.5 ml were loaded onto a 20 ml Superdex 200 HR gel 

filtration column (available from Pharmacia, Piscataway, NJ) equilibrated in TBS, 
operated on a BioLogic liquid chromatography system (available from BioRad, 
Burlingame, CA). About 1 ml fractions were then collected. Repetitive runs were 
performed until about 30 ml of each fraction was collected. The fractions were analyzed 

25 for CE activity using th° a'.say desc.ibed above in Example 2. In preparation for cation 
exchange chromatography, fractions having CE activity (V e =16-18 ml) were combined 
and dialyzed against about 2 liters of 20 mM MES buffer (2-(N- 

morpholino)ethanesulfonic acid), pH 6.0, containing 10 mM NaCl, for about 1.5 hours, 
and then against about 1 liter of the same buffer overnight at 4°C. Prior to loading onto 
30 the cation exchange chromatography column, the sample was again filtered through a 
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0.2 |im syringe filter to remove precipitated proteins. The sample was then applied to a 
Bio-Scale S2 cation exchange column (available from BioRad) at a rate of about 0.5 
ml/min. The column was washed with MES buffer until all unbound protein was 
removed. Protein bound to the column was then eluted with a linear gradient from 10 
m M to 1 M NaCl in 20 mM MES buffer, pH 6. Fractions were assayed for CE activity 
using the assay described above in Example 2. The results indicated that CE activity 
was not retained on the cation exchange column using the above conditions, and all of 
the activity was found in the flow-through fractions. 

Fractions containing CE activity were pooled and adjusted to pH 7 using 0.5 M 
Tris, pH 8.0, in preparation for anion exchange chromatography. The pooled fractions 
were then loaded onto a 4.5 mm x 50 mm Poros 10 HQ anion exchange chromatography 
column (available from PerSeptive Biosystems, Cambridge, MA) equilibrated in 25 
mM Tris buffer, pH 6.8. The column was washed with the loading buffer, and bound 
proteins were eluted with a linear gradient of 0 to 1 M NaCl in 25 mM Tris buffer, pH 
6.8. Fractions were tested for CE activity using the assay described above in Example 2. 
The results indicated that CE activity was eluted at about 170 mM NaCl. Fractions 
containing CE activity were pooled and diafiltered into TBS. 
Example 5 

This example describes the determination of N-terminal amino acid sequences of 
carboxylesterases isolated from prepupal flea larvae. 

A. Anion exchange chromatography fractions. 

Anion exchange chromatography fractions described above in Example 4 that 
contained proteins having CE activity were pooled, diafiltered into TBS buffer and 
concentrated 3-fold in a Speed-Vac Concentrator (available from Savant Instruments, 
Hoir.ook, NY). Proteins in the concentrated samples were then resolved on a reducing, 
10% SDS-PAGE Tris-glycine gel (available from Novex) for 1 hour at about 200 V. 
The proteins on the gel were then blotted onto a polyvinylidene difluoride (PVDF) 
membrane (available from Novex) for about 70 min in 10 mM CAPS buffer (3- 
[cyclohexylamino]-l-propanesulfonic acid; available from Sigma), pH 11, with 0.5 mM 
dithiothreitol (DTT). The membrane was then stained for 1 minute in 0.1% Coomassie 
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Blue R-250 dissolved in 40% methanol and 1% acetic acid. The membrane was 
destained in 50% methanol for about 10 minutes, rinsed with MilliQ water and air dried. 
Three stained protein bands were identified having apparent molecular weights of about 
64 kD, 65 kD, and 66 kD, respectively. The portion of the membrane containing each 
band was excised separately. Protein contained in each membrane segment was 
subjected to N-terminal amino acid sequencing using a 473A Protein Sequencer 
(available from Applied Biosystems, Foster City, CA) and using standard techniques. 

The results indicated that the N-terminal amino acid sequence of the putative 64 
kD protein was DPPTVTLPQGEL (denoted SEQ ID NO:39); the N-terminal amino acid 
sequence of the putative 65 kD protein was DPPT VTLP QGEL VGKATNEnxk (denoted 
SEQ ID NO:40); and the N-terminal amino acid sequence of the putative 66 kD protein 
was DppTVTLPQGEL (denoted SEQ ID NO:41), in which the lower case letters 
designate uncertainties and "x" designates an undetermined residue. 
B . Proteins Resolved by Native IEF-P AGE. 
1 5 Proteins isolated by anion exchange chromatography as described above in 

Section A were further resolved by native IEF-P AGE. Proteins were loaded onto a pH 
3-10 IEF gel (available from Novex) and separated in Novex's IEF buffers according to 
Novex's standard procedure (60 min at 100 V; then 60 min at 200 V; and then 30 min at 
500 V). Following electrophoresis, part of the gel was stained for CE activity using the 
20 method described above in Example 2. The remaining portion of the gel was blotted 
onto PVDF membrane by reversing the orientation of the gel and membrane so that 
positively charged proteins migrated to the membrane, electrophoresing the protein for 
60 min at 10 V, using 0.7% acetic acid as the transfer buffer. The membrane was 
stained as described above in Section A. After the membrane was dried, stained protein 
25 bands on the membrane were compared to bands on the gel tested for CE ac'i v : cy to 

identify corresponding bands. Protein bands on the membrane corresponding to proteins 
having CE activity were excised and submitted to N-lerminal sequencing as described in 
Section A. 

N-terminal amino acid sequence was obtained for protein contained in two bands 
30 having pi values of about pi 4.8 and about pi 4.9. N-terminal amino acid sequence of 
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the pi 4.8 band was DPPTVTLPQGELVGKALSNen (denoted SEQ ID NO:42) and N- 
terminal amino acid sequence of the pi 4.9 band was DPPTVTLP (denoted SEQ ID 
NO:43). A comparison of the N-terminal amino acid sequences identified here and 
described in Section A indicates closely related proteins having a consensus sequence of 
5 DPPTVTLPQGELVGKALTNEnGk (denoted SEQ ID NO:44). 

The amino acid sequences of SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, 
SEQ ID NO:42, SEQ ID NO:43 and SEQ ID NO:44 are substantially contained within 
SEQ ID NO:5, SEQ ID NO: 19 and SEQ ID NO:53, which are described below in 
Example 1 1 . 
10 Example 6 

This example describes partial purification of CE from 3rd instar flea larvae. 
Using the extract preparation methods described in Example 1 without labelling, 
extracts were prepared from about 50,000 bovine blood-fed 3rd instar flea larvae. The 
extract was then further purified over a p-aminobenzamidine linked agarose bead 
1 5 column using the method also described in Example 1. Collected unbound protein was 
concentrated to about 70 ml using a 200 ml stirred cell fitted with a YM-10 membrane 
(available from Amicon, Beverly, MA). Seven ml (about 5,000 3rd instar flea larval 
equivalents) of the concentrated extract was used for the remainder of the purification 
scheme described in Example 4. Resulting fractions from the anion exchange 
20 chromatography column were tested for CE activity using the assay described above in 
Example 2. 

The results indicated that CE activity was eluted in two overlapping peaks at 
about 120 mM and about 210 mM NaCl. 
Example 7 

25 This example describes the identifica^o- of JHE activity in different flea tissues. 

Tissue samples were prepared as described above in Example 1 from unfed and 
bovine blood-fed 1st instar flea larvae, bovine blood-fed 3rd instar flea larvae, bovine 
blood-fed prepupal flea larvae, unfed and cat blood-fed whole adult fleas, cat blood-fed 
adult partial fleas and cat blood-fed adult flea midguts. About 5 tissue equivalents of 

30 each tissue was assayed for JHE activity as follows. 
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Unlabeled juvenile hormone (JH; available from ICN Biomedicals, Inc., Aurora, 
OH) was diluted in hexane to concentration of about 0.025 M. Labeled 10- 3 H-juvenile 
hormone ( 3 H-JH; available from Dupont-NEN) was diluted in hexane to concentration of 
about 80,000 cpm/ul. A JH substrate mixture was prepared by mixing about 20 pi of 
5 unlabeled JH with about 80 ul of 3 H-JH (about 5 pCi) in a 4 ml screw cap vial. The 
substrate mixture was then covered with nitrogen (i.e., "blanketed") and the solvent 
contained in the mixture was evaporated by heating the mixture at 35oC. When just dry, 
about 1 ml of absolute anhydrous ethanol (final concentration 5 x 10 4 M, or 6400 
cpm/ul) was added to the vial. The substrate mixture was then stored at -20°C. 
1 0 About 5 equivalents of each tissue (about 5 pi of protein) was added into the 

bottom of a small glass autosampler vial. About 95 pi of Tris-buffered saline (TBS) was 
added to each vial to bring the final volume in each vial to about 100 pi. Two control 
samples were also prepared by adding 100 pi TBS to two separate vials. About 1 pi of 
the substrate mixture described above was added to all of the vials including the control 
15 samples. The final JH concentration in each vial was about 5 x 10" 6 M. The vials were 
then capped and spun in a microfuge to bring all of the liquid to the bottom of each vial. 
The vials were then transferred to a heat block and incubated at 35°C for about 30 
minutes. Following the incubation, enzyme activity was stopped by adding about 50 pi 
of methanol buffer (methanokwaterxoncentrated ammonium hydroxide at a 10:9:1 ratio, 
20 respectively) to each vial and removing the vials from the heat block. 

To measure labeled juvenile hormone acid, about 250 pi isooctane was added to 
each vial. Each vial was vortexed for about 15 seconds or until an emulsion formed. 
Each vial was then centrifuged in a microfuge for about 1 minute to separate aqueous 
and organic phases. About 75 pi of the aqueous layer was removed from each vial and 
added to abou* 2 nl Eco-lume scintillation fluid (available from ICN). The amount of 
>H-juvenile hormone acid contained in each vial was determined using a Beckman LS- 
1801 liquid scintillation counter (available from Beckman, Fullerton, CA). 

The results shown in Fig.5 indicated that all flea tissues tested contain active 
JHE. Referring to Example 2, the level of CE activity differed from JHE activity in 



25 
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various tissue samples. The combined JHE and CE data indicated the differential 
expression of these two enzymatic activities during the development of a flea. 
Example 8 

This example describes the purification of JHE protein from cat blood-fed adult 
5 midguts. 

About 23,000 cat blood-fed adult midguts were collected and prepared using the 
method described in Example 1. The extract was then added in 4 aliquots to columns 
containing about 3 to about 5 ml of p-aminobenzamidine linked agarose beads (available 
from Sigma), equilibrated in 50 mM Tris (pH 8.0), 100 mM CaCl 2 , 400 mM NaCl, and 
10 incubated overnight at 4°C. The columns were then washed with about 15 to about 125 
ml of the equilibration Tris buffer to remove unbound protein. The collected unbound 
protein was pooled and then concentrated to a volume of about 5 ml using an Ultrafree- 
20 10 kD centrifugal concentrator (available from Millipore, Bedford, MA) and filtered 
sequentially through a 0.2 ^im centrifugal ultrafiltration membrane (available from Lida, 
15 Kenosha, WI) to clarify the sample for chromatography. 

Aliquots of about 0.5 ml were loaded onto a Superdex 200 HR gel filtration 
column using the method described in Example 4. Repeated runs were performed until 
about 10 ml of each fraction was collected. The fractions were analyzed for JHE activity 
using the assay described in Example 7. In preparation for anion exchange 
20 chromatography, fractions having JHE activity (V c =l 7-1 8 ml) were combined and 

dialyzed overnight against about 1 L of 20 mM Tris buffer, pH 8.0, containing 10 mM 
NaCl. The sample was then loaded onto a Poros 10 HQ anion exchange column using 
the method described in Example 4. Resulting fractions were tested for JHE activity as 
described in Example 7. 
If The results indicated that midgut JHE activity was eluted from the anion 

exchange column in a single peak at about 120 mM NaCl. 
Example 9 

This example describes partial purification of JHE from prepupal flea larvae and 
3rd instar larvae. 
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A. JHE Purification from Prepupal Tissue. 

Using the extract preparation methods described in Example 1, gel filtration 
fractions were obtained using a Superdex 200 HR gel filtration column (available from 
Pharmacia) using the method described in Example 4, from about 15,000 bovine blood- 
fed prepupal flea larvae. The fractions were analyzed for JHE activity using the assay 
described above in Example 7. Those fractions containing protein having JHE activity 
(V e =16-18 ml) were combined and dialyzed using the method described in Example 8. 

The fractions were then further purified by passing the fractions over a Bio-Scale 
S2 cation exchange column (available from BioRad) at a rate of about 0.5 ml/min. The 
1 0 column was washed with MES until all unbound protein was eluted. Bound protein was 
then eluted with a linear gradient of 20 mM MES buffer, pH 6.0, containing 10 mM 
NaCl to 1 M NaCl. Resulting fractions were assayed for JHE activity using the method 
described in Example 7. The results indicated that proteins having JHE activity using 
prepupal tissue eluted from the column in about 200 to 300 mM NaCl. 
1 5 The fractions containing JHE activity were combined and the pH adjusted to pH - 

7 using 0.5 M Tris buffer (pH 8.0). The fractions were then dialyzed twice against about 
1 liter of 10 mM phosphate buffer (pH 7.2) containing 10 mM NaCl at 4°C. The 
resulting dialyzed fractions were then loaded onto a Bio-Scale CHT2-1 Hydroxyapatite 
Column (available from BioRad) at a rate of about 0.5 ml/min. Unbound protein was 
20 washed from the column using the dialysis buffer. Bound protein was then eluted with a 
linear gradient of from 10 mM phosphate buffer, pH 7.2, containing 10 mM NaCl to 0.5 
M phosphate buffer pH 6.5 containing 10 mM NaCl. One ml fractions were collected 
and each tested for JHE activity by the method described in Example 7. 

The results indicated that JHE eluted in 2 overlapping peaks at about 100 mM 
25 and 1 50 mM phosphate. These two JHE activities were designated PF JHE i aad PP 

JHE II, and were kept separate for the remainder of the purification. Both JHE samples, 
were dialyzed overnight against 20 mM Tris buffer (pH 8.0) containing 10 mM NaCl. 
The two samples were then loaded, separately, onto a 4.5 mm x 50 mm Poros 10 HQ 
anion exchange chromatography column (available from PerSeptive Biosystems) 
30 equilibrated with 20 mM Tris buffer, pH 8.0, containing 10 mM NaCl. Unbound 
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proteins were washed from the column using the same buffer. Bound proteins were 
eluted with a linear gradient of from 10 mM to 1 M NaCl in 20 mM Tris buffer, pH 8.0. 
Resulting fractions were tested for JHE activity using the method described in Example 
7. 

5 The results indicated that in both samples, JHE activity was eluted from the 

column in a single peak at about 100 mM NaCl. 

B. JHE Purification from 3 rd Instar Tissue 

Using the procedure described above in Section A, proteins having JHE activity 
were obtained using about 5,000 bovine blood-fed 3 rd instar flea larvae. Following 
10 purification by cation exchange, proteins having JHE activity using 3 rd instar tissue were 
found to elute in 2 peaks. The first peak having JHE activity was not retained on the 
column and also exhibited CE activity (referred to herein as CE/JHE fractions). The 
second peak having JHE activity eluted from the column in about 100-200 mM NaCl 
and did not contain CE activity. 
15 The CE/JHE fractions were pooled and adjusted to about pH 7 using 0.5 M Tris, 

pH 8.0. The fractions were then loaded onto a 4.5 mm x 50 mm Poros 10 HQ anion 
exchange chromatography column (available from PerSeptive Biosystems) and the 
column was equilibrated in 25 mM Tris buffer, pH 6.8. The column was washed with 
the same buffer and bound proteins were eluted with a linear gradient of 0 to 1 M NaCl 
20 in 25 mM Tris buffer, pH 6.8. Fractions were then tested for JHE activity using the 
method described in Example 7. JHE activity was eluted in two overlapping peaks at 
about 120 mM and 210 mM NaCl. The fraction containing JHE activity also contained 
CE activity when tested using the method described in Example 2. 

Fractions from the cation exchange column containing only JHE activity were 
25 combined, diluted in 20 mM Tris bufe pH 8.0 containing 10 mM NaCl, and 
concentrated to about 5 ml. The fractions were purified on a Poros 10 HQ anion 
exchange chromatography column as described immediately above. Fractions were then 
tested for JHE activity using the method described in Example 7. The JHE activity was 
eluted in a single peak at about 120 mM. The peak contained no detectable CE activity. 
30 Example 10 
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This example describes the purification of JHE protein from unfed adult midguts. 
About 16,000 unfed adult midguts were collected in 20 mM Tris buffer (pH 7.7), 
containing 130 mM NaCl, 1 mM sodium EDTA, 1 mM Pefabloc® (available from 
Boehringer Mannheim, Indianapolis, IN), 1 microgram/ml (ug/ml) leupeptin and 1 
ug/ml pepstatin. The midguts were homogenized by freeze-fracture and sonication, and 
then centrifuged at about 14,000 x g for 20 min. The soluble material from the 
centrifugation step was recovered. The soluble material was then concentrated to about 
1 ml using an Ultrafree-20 10 kD centrifugal concentrator (available from Millipore) and 
filtered sequentially through a 0.2 urn centrifugal ultrafiltration membrane to clarify the 
sample for chromatography. Aliquots of about 0.5 ml were loaded onto a Superdex 200 
HR gel filtration column using the method described in Example 4. Repeated column 
runs were performed until about 2 ml of each fraction was collected. The fractions were 
analyzed for JHE activity using the assay described in Example 7. In preparation for 
cation exchange chromatography, fractions having JHE activity (V e =15-17 ml) were 
combined and dialyzed overnight against about 1 L of 20 mM MES buffer, pH 6.0, 
containing 10 mM NaCl. The sample was then applied to a Bio-Scale S2 cation 
exchange column using the method described in Example 4. Fractions of eluate were 
assayed for JHE activity using the method described in Example 7. 

The results indicate that JHE is present in unfed midguts in two forms, one that is 
not retained on the cation exchange column and one that is bound to the column under 
low salt conditions at about 100 mM NaCl. The form that was not retained under low 
salt conditions was shown to have general CE activity using the methods described in 
Example 2. 
Example 1 1 

This exar-.ple describes the identification of certain esterase nucleic acid 
molecules of the present invention. 

Several flea esterase nucleic acid molecules, representing one or more partial flea 
esterase genes, were PCR amplified from a flea mixed instar cDNA library or a flea 
prepupal cDNA library. The flea mixed instar cDNA library was produced using unfed 
1st instar, bovine blood-fed 1st instar, bovine blood-fed 2 nd instar and bovine blood-fed 
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3 rd instar flea larvae (this combination of tissues is referred to herein as mixed instar 
larval tissues for purposes of this example). The flea prepupal cDNA library was 
produced using prepupal flea larvae. For each library, total RNA was extracted from 
mixed instar or prepupal tissue, respectfully, using an acid-guanidinium-phenol- 
chloroform method similar to that described by Chomczynski et al., 1987, Anal. 
Biochem. 162, p. 156-159. Approximately 5,164 mixed instar larvae or 3,653 prepupal 
larvae were used in each RNA preparation. Poly A+ selected RNA was separated from 
each total RNA preparation by oligo-dT cellulose chromatography using 
Poly(A)Quick© mRNA isolation kits (available from Stratagene Cloning Systems, La 
Jolla, CA), according to the method recommended by the manufacturer. 

A mixed instar cDNA expression library and a prepupal cDNA expression library 
were constructed in lambda {X) Uni-ZAP™XR vector (available from Stratagene 
Cloning Systems) using Stratagene's ZAP-cDNA Synthesis Kit® protocol. About 6.34 
ug of mixed instar poly A+ RNA were used to produce the mixed instar library and 
about 6.72 ug of prepupal poly A+ RNA were used to produce the prepupal library. 
The resultant mixed instar library was amplified to a titer of about 2.17 x 10 10 pfu/ml 
with about 97% recombinants. The resultant prepupal library was amplified to a titer of 
about 3.5 x 1 0 10 pfu/ml with about 97% recombinants. 

A pair of primers was used to amplify DNA from the cDNA libraries. A sense 
vector primer T-3X (corresponding to the vector in which nucleic acid molecules of the 
present invention had been ligated), having the nucleic acid sequence AATTAACCCT 
CACTAAAGGG (available from Gibco BRL, Gaithersburg, MD; denoted SEQ ID 
NO:45), was used in combination with a degenerate primer, the design of which was 
based on a highly conserved esterase amino acid sequence (disclosed in Hanzlik et al., J. 
Biol. Chem. 264:12419-12423, 1989; IY/HGGG F/L) located in a region downstream 
from the mature amino terminus in a number of known esterases. The degenerate 
primer, referred to herein as FCEF, is an anti-sense primer having the nucleic acid 
sequence ARDCCDCCDC CRTRDAT ( R indicating an A or G; and D indicating an A, 
G or T; denoted SEQ ID NO:46). The resultant PCR products from the mixed instar 
cDNA library, obtained using standard PCR conditions (e.g., Sambrook et al., ibid.), 
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were about 550 nucleotides. The resultant PCR products from the prepupal cDNA 
library were from about 500 nucleotides to about 860 nucleotides. 
A. PCR Products. 

PCR products were gel purified and cloned into the TA Vector™ (available from 
5 InVitrogen Corp., San Diego, CA). Approximately 8 clones were identified from the 
prepupal library and 6 clones were identified from the mixed instar library. These 
nucleic acid molecules were subjected to nucleic acid sequencing using the Sanger 
dideoxy chain termination method, as described in Sambrook et al., ibid. 

1 . Flea esterase clone 1 isolated from the mixed instar cDNA library 
10 was determined to comprise nucleic acid molecule nfEl 40I , the nucleic acid sequence of 

the coding strand which is denoted herein as SEQ ID NO:l. Translation of SEQ ID 
NO:l suggests that nucleic acid molecule nfEl 401 encodes a non-full-length flea esterase 
protein of about 103 amino acids, referred to herein as PfEl 103 , having amino acid 
sequence SEQ ID NO:2, assuming an initiation codon spanning from nucleotide 92 

15 through nucleotide 94 of SEQ ID NO:l. The complement of SEQ ID NO:l is 

represented herein by SEQ ID NO:3 . Comparison of amino acid sequence SEQ ID 
NO:2 (i.e., the amino acid sequence of PfEl 103 ) with amino acid sequences reported in 
GenBank indicates that SEQ ID NO:2, showed the most homology, i.e., about 33% 
identity, between SEQ ID NO:2 and alpha esterase protein from Drosophila 

20 melanogaster. 

2. Flea esterase clone 2 isolated from the mixed instar cDNA library 
was determined to comprise nucleic acid molecule nfE2 364 , the nucleic acid sequence of 
the coding strand which is denoted herein as SEQ ID NO:4. Translation of SEQ ID 
NO:4 suggests that nucleic acid molecule nfE2 364 encodes a non-full-length flea esterase 

25 protein of about 121 amino acids, referred to herein as PfE? 12 -, having ..mino acid 
sequence SEQ ID NO:5, assuming the first codon spans from nucleotide 2 through 
nucleotide 4 of SEQ ID NO:4. The complement of SEQ ID NO:4 is represented herein 
by SEQ ID NO:6. Comparison of nucleic acid sequence SEQ ID NO:4 with nucleic acid 
sequences reported in GenBank indicates that SEQ ID NO:4 showed the most 

30 homology, i.e., about 43% identity, between SEQ ID NO:4 and a H. virescens JHE gene. 
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Comparison of amino acid sequence SEQ ID NO: 5 (i.e., the amino acid sequence of 
PfE2 12 i) with amino acid sequences reported in GenBank indicates that SEQ ID NO:5, 
showed the most homology, i.e., about 38% identity, between SEQ ID NO:5 and alpha 
esterase protein from Drosophila melanogaster. 

3. Flea esterase clone 3 isolated from the prepupal cDNA library was 
determined to comprise nucleic acid molecule nfE3 421 , the nucleic acid sequence of the 
coding strand which is denoted herein as SEQ ID NO:7. Translation of SEQ ID NO:7 
suggests that nucleic acid molecule nfE3 42 , encodes a non-full-length flea esterase 
protein of about 103 amino acids, referred to herein as PfE3, 03 , having amino acid 
sequence SEQ ID NO:8, assuming an initiation codon spanning from nucleotide 1 13 
through nucleotide 1 1 5 of SEQ ID NO:7. The complement of SEQ ID NO:7 is 
represented herein by SEQ ID NO:9. Comparison of nucleic acid sequence SEQ ID 
NO:7 with nucleic acid sequences reported in GenBank indicates that SEQ ID NO:7 
showed the most homology, i.e., about 53% identity, between SEQ ID NO:7 and a 
Torpedo marmorata acetylcholinesterase gene. Comparison of amino acid sequence 
SEQ ID NO:8 (i.e., the amino acid sequence of PfE3 103 ) with amino acid sequences 
reported in GenBank indicates that SEQ ID NO:8, showed the most homology, i.e., 
about 39% identity, between SEQ ID NO:5 and alpha esterase protein from Drosophila 
melanogaster. 

4. Flea esterase clone 4 isolated from the prepupal cDNA library was 
determined to comprise nucleic acid molecule nfE4 524 , the nucleic acid sequence of the 
coding strand which is denoted herein as SEQ ID NO: 10. Translation of SEQ ID NO: 10 
suggests that nucleic acid molecule nfE4 524 encodes a non-full-length flea esterase 
protein of about 137 amino acids, referred to herein as PfE4, 37 , having amino acid 
sequence SEQ ID NO:ll, Fssuning ar initiation codon spanning from nucleotide 113 
through nucleotide 115 of SEQ ID NO:10. The complement of SEQ ID NO:10 is 
represented herein by SEQ ID NO: 12. Comparison of nucleic acid sequence SEQ ID 
NO: 10 with nucleic acid sequences reported in GenBank indicates that SEQ ID NO: 10 
showed the most homology, i.e., about 47% identity, between SEQ ID NO: 10 and an 
Anas platyrhyncos thioesterase B gene. Comparison of amino acid sequence SEQ ID 
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NO: 11 (i.e., the amino acid sequence of PfE4, 37 ) with amino acid sequences reported in 
GenBank indicates that SEQ ID NO:l 1, showed the most homology, i.e., about 30% 
identity, between SEQ ID NO:l 1 and Leptinotarsa decemlineata acetylcholinesterase. 
B. cDNA Clones. 

Certain amplified PCR fragments were used as probes to identify full-length flea 
esterase genes in the prepupal cDNA library. 

1 . Nucleic acid molecule nfE2 364 was labeled with 32 P and used as a 
probe to screen the mixed instar cDNA library described in Section A, using standard 
hybridization techniques. Two clones were isolated. A first clone included about a 
2300-nucleotide insert, referred to herein as nfE5 2300 . Nucleic acid sequence was 
obtained using standard techniques from nfE5 2300 , to yield a flea esterase nucleic acid 
molecule named nfE5, 98 2 having a nucleic acid sequence of the coding strand which is 
denoted herein as SEQ ID NO:13. Translation of SEQ ID NO:13 suggests that nucleic 
acid molecule nfE5 1982 encodes a non-full-length flea esterase protein of about 505 amino 
acids, referred to herein as PfE5 505 , having amino acid sequence SEQ ID NO: 14, 
assuming the first codon spans from nucleotide 1 through nucleotide 3 of SEQ ID 
NO: 13 and the stop codon spans from nucleotide 1518 through nucleotide 1520 of SEQ 
ID NO:13. The complement of SEQ ID NO:13 is represented herein by SEQ ID NO:15. 
The amino acid sequence of PfE5 505 (i.e., SEQ ID NO: 14) predicts that PfE5 505 has an 
estimated molecular weight of about 56.8 kD and an estimated pi of about 5.5. The 
nucleic acid molecule representing the coding region for PfE5 505 is referred to herein as 
nfE5 1515 ; the nucleic acid sequences of the coding strand and the complementary strand 
are represented by SEQ ID NO: 16 and SEQ ID NO: 17, respectively. 

The nucleic acid sequence of nfE5 1982 was used to design primers to use in 
-jombin^ion with a vector primer to PCR amplify the 5' terminal fragment of the 
remainder of the flea esterase coding region from the flea mixed instar cDNA library. A 
pair of primers was used to amplify DNA from the cDNA library. A sense vector primer 
T3-X (corresponding to the vector in which nucleic acid molecules of the present 
invention had been ligated), having the nucleic acid sequence 5' AATTAACCCT 
CACTAAAGGG 3' (denoted SEQ ID NO:45), was used in combination with an anti- 
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sense primer M6/M265', having the nucleic acid sequence 5' GTGCGTACAC 
GTTTACTACC 3' (denoted SEQ ID NO:56). The resultant PCR product from the 
mixed instar cDNA library, obtained using standard PCR conditions (e.g., Sambrook et 
al., ibid.), were about 354 nucleotides. 

The PCR product was subjected to DNA sequencing analysis, and a composite 
sequence representing a full-length flea esterase coding region was deduced. The 
nucleic acid sequence of the composite nucleic acid molecule, referred to herein as 
nfE5 2l4 4 is denoted herein as SEQ ID NO:57. Translation of SEQ ID NO:57 suggests 
that nucleic acid molecule nfE5 2 M 4 encodes a full-length flea esterase protein of about 
550 amino acids, referred to herein as PfE5 550 , having amino acid sequence SEQ ID 
NO:58, assuming an open reading frame in which the initiation codon spans from 
nucleotide 30 through nucleotide 32 of SEQ ID NO:57 and the stop codon spans from 
nucleotide 1680 through nucleotide 1682 of SEQ ID NO:57. The complement of SEQ 
ID NO:57 is represented herein by SEQ ID NO:59. The coding region encoding PfE5 550 
is represented by the nucleic acid molecule nfE5 I650 , having a coding strand with the 
nucleic acid sequence represented by SEQ ID NO:60 and a complementary strand with 
nucleic acid sequence SEQ ID NO:61. The amino acid sequence of PfE5 550 (i.e., SEQ 
ID NO:58) predicts that PfE5 550 has an estimated molecular weight of about 61 .8 kD and 
an estimated pi of about 5.5. 

Comparison of nucleic acid sequence SEQ ID NO:57 with nucleic acid 
sequences reported in GenBank indicates that SEQ ID NO:57 showed the most 
homology, i.e., about 41% identity, between SEQ ID NO:57 and a M persicae esterase 
FE4 mRNA sequence. Comparison of amino acid sequence SEQ ID NO:58 (i.e., the 
amino acid sequence of PfE5 550 ) with amino acid sequences reported in GenBank 
indicates that SEQ ID NO:58 showed the most homology, i.e., about 36% identity 
between SEQ ID NO:58 and Drosophila melanogster alpha esterase protein. 

A second clone included about a 1900 nucleotide insert, referred to herein as 
nfE6i 900 . Nucleic acid sequence was obtained using standard techniques from nfE6, 900 , 
to yield a flea esterase nucleic acid molecule named nfE6, 79 2 having a nucleic acid 
sequence of the coding strand which is denoted herein as SEQ ID NO: 18. Translation of 
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SEQ ID NO: 18 suggests that nucleic acid molecule nfE6 1792 encodes a full-length flea 
esterase protein of about 550 amino acids, referred to herein as P£E6 5 5 0 , having amino 
acid sequence SEQ ID NO: 19, assuming an open reading frame in which the initiation 
codon spans from nucleotide 49 through nucleotide 51 of SEQ ID NO: 18 and a stop 
codon spanning from nucleotide 1699 through nucleotide 1701 of SEQ ID NO: 18. The 
complement of SEQ ID NO: 1 8 is represented herein by SEQ ID NO:20. The coding 
region encoding PfE6 550 , is represented by nucleic acid molecule nfE6 I05 o, having a 
coding strand with the nucleic acid sequence represented by SEQ ID NO:21 and a 
complementary strand with nucleic acid sequence SEQ ID NO:22. The proposed mature 
protein, denoted herein as PfE6 53 o, contains about 530 amino acids which is represented 
herein as SEQ ID NO:53. The nucleic acid molecule encoding PfE6 530 is denoted herein 
as niE6 1590 and has a coding strand having the nucleic acid sequence SEQ ID NO:23. 
The amino acid sequence of PfE6 5S0 (i.e., SEQ ID NO: 19) predicts that P£E6 SS0 has an 
estimated molecular weight of about 61.8 kD and an estimated pi of about 5.5. 

Comparison of nucleic acid sequence SEQ ID NO: 18 with nucleic acid 
sequences reported in GenBank indicates that SEQ ID NO: 18 showed the most 
homology, i.e., about 41% identity, between SEQ ID NO:18 and a Myzus pericae 
esterase gene. Comparison of amino acid sequence SEQ ID NO: 19 (i.e., the amino acid 
sequence of PfE6j 50 ) with amino acid sequences reported in GenBank indicates that SEQ 
ID NO:19 showed the most homology, i.e., about 28% identity between SEQ ID NO:19 
and Drosophila melanogaster alpha esterase protein. 

2. Nucleic acid molecule nfE4 524 was labeled with 32 P and used as a 
probe to screen the prepupal cDNA library described in Example 11, using standard 
hybridization techniques (e.g., Sambrook et al., ibid). Two clones were isolated. A first 
clone included about a 3000 nucleotide insert, referred to herein as nfE7 3000 - Nucleic 
acid sequence was obtained using standard techniques from nfE7 3000 , to yield a flea 
esterase nucleic acid molecule named nfE7 2836 having a nucleic acid sequence of the 
coding strand which is denoted herein as SEQ ID NO:24. Translation of SEQ ID NO:24 
suggests that nucleic acid molecule nfE7 2836 encodes a full-length flea esterase protein of 
about 596 amino acids, referred to herein as PfE7 596 , having amino acid sequence SEQ 
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ID NO:25, assuming an open reading frame in which the initiation codon spans from 
nucleotide 99 through nucleotide 101 of SEQ ID NO:24 and a stop codon spanning from 
nucleotide 1887 through nucleotide 1889 of SEQ ID NO:25. The complement of SEQ 
ID NO:24 is represented herein by SEQ ID NO:26. The coding region encoding PfE7 596 , 
is represented by nucleic acid molecule nfE7 l788 , having a coding strand with the nucleic 
acid sequence represented by SEQ ID NO:28 and a complementary strand with nucleic 
acid sequence SEQ ID NO:29. The proposed mature protein, denoted herein as PfE7 570 , 
contains about 570 amino acids which is represented herein as SEQ ID NO:54. The 
nucleic acid molecule encoding PfE7 570 is denoted herein as nfE7 l710 and has a coding 
strand having the nucleic acid sequence SEQ ID NO:27. The amino acid sequence of 
PfE7 596 (i.e., SEQ ID NO:25) predicts that PfE7 596 has an estimated molecular weight of 
about 68.7 kD and an estimated pi of about 6.1 . 

Comparison of nucleic acid sequence SEQ ID NO:24 with nucleic acid 
sequences reported in GenBank indicates that SEQ ID NO:24 showed the most 
homology, i.e., about 48% identity, between SEQ ID NO:24 and an Anas platychyncos 
thioesterase B gene. Comparison of amino acid sequence SEQ ID NO:25 (i.e., the 
amino acid sequence of PfE7 596 ) with amino acid sequences reported in GenBank 
indicates that SEQ ID NO:25 showed the most homology, i.e., about 27% identity 
between SEQ ID NO:25 and Drosophila melanogaster alpha esterase protein. 

A second clone included about a 3000 nucleotide insert, referred to herein as 
nfE8 300 o- Nucleic acid sequence was obtained using standard techniques from nfES 3000 , 
to yield a flea esterase nucleic acid molecule named nfE8 28 oi having a nucleic acid 
sequence of the coding strand which is denoted herein as SEQ ID NO:30. Translation of 
SEQ ID NO:30 suggests that nucleic acid molecule nfE8 280I encodes a full-length flea 
esterase protein of about 59d amino acids, referred to herein as PfE8 595 , h? 'ir g amino 
acid sequence SEQ ID NO:31, assuming an open reading frame in which the initiation 
codon spans from nucleotide 99 through nucleotide 101 of SEQ ID NO:30 and a stop 
codon spanning from nucleotide 1884 through nucleotide 1886 of SEQ ID NO:30. The 
complement of SEQ ID NO:30 is represented herein by SEQ ID NO:32. The coding 
region encoding PfE8 595 , is represented by nucleic acid molecule nfE8 l785 , having a 
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coding strand with the nucleic acid sequence represented by SEQ ID NO:34 and a 
complementary strand with nucleic acid sequence SEQ ID NO:35. The proposed mature 
protein, denoted herein as PfE8 570 , contains about 570 amino acids which is represented 
herein as SEQ ID NO:55. The nucleic acid molecule encoding PfE8 570 is denoted herein 
as nfE8 l7I0 and has a coding strand having the nucleic acid sequence SEQ ID NO:33. 
The amino acid sequence of PfE8 595 (i.e., SEQ ID NO:31) predicts that PfE8 595 has an 
estimated molecular weight of about 68.6 kD and an estimated pi of about 6.1. 

Comparison of nucleic acid sequence SEQ ID NO:30 with nucleic acid 
sequences reported in GenBank indicates that SEQ ID NO:30 showed the most 
homology, i.e., about 46% identity, between SEQ ID NO:30 and a Mas musculus 
carboxyl ester lipase gene. Comparison of amino acid sequence SEQ ID NO:31 (i.e., the 
amino acid sequence of PfE8 595 ) with amino acid sequences reported in GenBank 
indicates that SEQ ID NO:31 showed the most homology, i.e., about 28% identity 
between SEQ ID NO:31 and estalpha-2 esterase of Cidex pipiens quinquefasciatus. 

3. Nucleic acid molecule nfE3 421 was labeled with 32 P and used as a 
probe to screen the prepupal cDNA library using standard hybridization techniques (e.g., 
Sambrook et al., ibid.). Two clones were isolated. One clone included about a 1900 
nucleotide insert, referred to herein as nfE9 1900 . Nucleic acid sequence was obtained 
using standard techniques from nfE9 1900 , to yield a flea esterase nucleic acid molecule 
named nfE9 200 7 having nucleic acid sequence of the coding strand which is denoted 
herein as SEQ ID NO:36. Translation of SEQ ID NO:36 suggests that nucleic acid 
molecule nfE9 200 7 encodes a full-length flea esterase protein of about 528 amino acids, 
referred to herein as PfE9 528 , having amino acid sequence SEQ ID NO:37, assuming an 
open reading frame in which the initiation codon spans from nucleotide 1 1 through 
nucleotide 13 of SEQ ID NO:36 and a stop co Jon spanning from nucleotide 1595 
through nucleotide 1597 of SEQ ID NO:36. The complement of SEQ ID NO:36 is 
represented herein by SEQ ID NO:38. The coding region encoding PfE9 528 , is 
represented by nucleic acid molecule nfE9 1584 , having a coding strand with the nucleic 
acid sequence represented by SEQ ID NO:51 and a complementary strand with nucleic 
acid sequence SEQ ID NO:52. The amino acid sequence of PfE9 528 (i.e., SEQ ID 
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NO:37) predicts that PfE9 528 has an estimated molecular weight of about 60 kD and an 
estimated pi of about 5.43. 

Comparison of nucleic acid sequence SEQ ID NO:36 with nucleic acid 
sequences reported in GenBank indicates that SEQ ID NO:36 showed the most 
5 homology, i.e., about 47% identity, between SEQ ID NO:36 and a hamster mRNA for 
carboxylesterase precursor gene. Comparison of amino acid sequence SEQ ID NO:37 
(i.e., the amino acid sequence of PfE9 S28 ) with amino acid sequences reported in 
GenBank indicates that SEQ ID NO:37 showed the most homology, i.e., about 37% 
identity between SEQ ID NO:37 and alpha esterase protein from Drosophila 

1 0 melanogaster. 

As is the case for any of the nucleic acid molecules described in this example, 
variations between sequences may be due to a number of factors, such as but not limited 
to, sequencing errors or allelic variation. 

4. Nucleic acid molecule niEl 40 i was labeled with 32 P and used as a 
1 5 probe to screen the mixed instar cDNA library using standard hybridization techniques 
(e.g., Sambrook et al., ibid.). A clone was isolated that included about a 2000 nucleotide 
insert, referred to herein as nfE10 200 o. Nucleic acid sequence was obtained using 
standard techniques from nfE10 20 oo, to yield a flea esterase nucleic acid molecule named 
nfEKW having nucleic acid sequence of the coding strand which is denoted herein as 
20 SEQ ID NO:67. Translation of SEQ ID NO:67 suggests that nucleic acid molecule 

nfEl (W encodes a full-length flea esterase protein of about 530 amino acids, referred to 
herein as Pf£10 53 o, having amino acid sequence SEQ ID NO:68, assuming an open 
reading frame in which the initiation codon spans from nucleotide 231 through 
nucleotide 233 of SEQ ID NO:67 and a stop codon spanning from nucleotide 1821 
25 through nuc! -o* :de 1 823 of SEQ ID NO:67. The complement of SEQ ID NO:67 is 
represented herein by SEQ ID NO:69. The coding region encoding PfE10 530 , is 
represented by nucleic acid molecule nfE10 1590 , having a coding strand with the nucleic 
acid sequence represented by SEQ ID NO:70 and a complementary strand with nucleic 
acid sequence SEQ ID NO:71. The amino acid sequence of PfE10 53 o(i.e., SEQ ID 
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NO:68) predicts that PfE10 530 has an estimated molecular weight of about 59.5 kD and 

an estimated pi of about 5.5. 

Comparison of nucleic acid sequence SEQ ID NO:67 with nucleic acid 

sequences reported in GenBank indicates that SEQ ID NO:67 showed the most 
5 homology, i.e., about 48% identity, between SEQ ID NO:67 and a Lucilia cuprina alpha 

esterase gene (genembl # U56636) gene. Comparison of amino acid sequence SEQ ID 

NO:68 (i.e., the amino acid sequence of PfE10 530 ) with amino acid sequences reported in 

GenBank indicates that SEQ ID NO:68 showed the most homology, i.e., about 30% 

identity between SEQ ID NO:68 and Culex pipens esterase bl precurser protein 
10 (swissprot#P16854). 

As is the case for any of the nucleic acid molecules described in this example, 

variations between sequences may be due to a number of factors, such as but not limited 

to, sequencing errors or allelic variation. 

Example 12 

15 This Example demonstrates the production of esterase proteins of the present 

invention in E. coli cells. 

A. Flea esterase protein PHIS-PfE7 570 and flea esterase protein PHIS-PfES 570 
were produced in the following manner. A pair of primers was used to amplify DNA 
from flea esterase nucleic acid molecule nfE7 2S36 or nfE8 2SO , produced as described in 
20 Example 1 1 . A sense primer containing an XIwl site (shown in bold) having the nucleic 
acid sequence 5' TGTGCTCGAG ATGGGATAAC CTAGATCAGC ATTTGTGC 3' 
(denoted SEQ ID NO:47), was used in combination with an anti-sense primer containing 
a Kpnl site (shown in bold) having the nucleic acid sequence 5' TTAAGGTACC 
TCATCTAATA CTTCCTTCAT TACAG 3' (denoted SEQ ID NO:48). A PCR product 
25 was derived from nfE7 2836 , and is referred to herei r j nfE7 1710 , having nucleic acid 
sequence SEQ ID NO:27. The PCR product was digested with XJiol and Kpnl 
restriction endonucleases, gel purified and subcloned into expression vector pTrcHisB 
(available from InVitrogen). The resultant recombinant molecule, referred to herein as 
P Trc-nfE7 m0 , was transformed into E. coli HB101 competent cells (available from 
30 Gibco BRL) to form recombinant cell E. co//:pTrc-nfE7 1710 . 
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The PCR product derived from nfE8 2g0 | using the primers is referred to herein as 
nfE8 mo , having nucleic acid sequence SEQ ID NO:33. PCR product nfE8 mo was 
digested with A7;oI and Kpnl restriction endonucleases, gel purified and subcloned into 
expression vector pTrcHisB. The resultant recombinant molecule, referred to herein as 
5 P Trc-nfE8 1710 , was transformed into E. coli HB 101 competent cells to form recombinant 
cell E. co//:pTrc-nfE8, 710 . 

The recombinant cells were cultured in enriched bacterial growth medium 
containing 0.1 mg/ml ampicillin and 0.1% glucose at about 32°C. When the cells 
reached an OD 000 of about 0.4-0.5, expression of recombinant protein was induced by the 
1 0 addition of 0.5 mM isopropyl-B-D-thiogalactoside (IPTG), and the cells were cultured 
for about 2 hours at about 32°C. Immunoblot analysis of recombinant cell E. co//:pTrc- 
nfE7 1710 and E. co//:pTrc-nfE8 1710 lysates using a T7 tag monoclonal antibody (available 
from Novagen, Inc., Madison, WI) directed against the fusion portion of the recombinant 
PHIS-PfE7 570 and PHIS-PfE8 570 fusion proteins identified proteins of appropriate size, 
15 namely an about 65 kD protein for each fusion protein. 

B. Flea esterase protein PHIS-PfE6 540 was produced in the following 
manner. A pair of primers was used to amplify DNA from flea esterase nucleic acid 
molecule nfE6 1792 produced as described in Example 1 1. A sense primer containing an 
A7;oI site having the nucleic acid sequence 5' AAACTCGAGT CCCCCGACTG 
20 TAACTTTGC 3' (denoted SEQ ID NO:62; XIiol site shown in bold), was used in 
combination with an anti-sense primer containing a Pstl site having the nucleic acid 
sequence 5 1 TCATCTGCAG TTATTGACTG TGCAAAGTTT TTGTGG 3' (denoted 
SEQ ID NO:63; Pstl site shown in bold). A PCR product was derived from nfE6 1792 , 
and is referred to herein as nfE6 14g8 , having nucleic acid sequence SEQ ID NO:76. The 
25 PCR product was digested with Xliol and Pstl restriction endonvcleases, gel purified and 
subcloned into expression vector lambdaP R /TWS10HIS-RSET-A9, that had been 
digested with A7;oI and Pstl and dephosphorylated.. The resultant recombinant 
molecule, referred to herein as P Cro-nfE6 1488 , was transformed into E. coli HB101 
competent cells (available from Gibco BRL) to form recombinant cell E. co/r.pCro- 
30 nfE6 l488 . 
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The recombinant cells were cultured using the method generally described in 
Section A of this example, except that the cells were grown under heat shift conditions 
rather than in the presence of IPTG. The cells were grown at 32°C for about 2 hours, 
and then grown at 42°C. Immunoblot analysis of recombinant cell E. co//:pCro-nfE6 14S8 

5 lysate using a T7 tag monoclonal antibody directed against the fusion portion of the 
recombinant PHIS-PfE6 540 fusion protein identified proteins of appropriate size, namely 
an about 60 kD protein for each fusion protein. 

Expression of the recombinant PHIS-Pffi6 540 fusion protein was improved by 
transforming supercoiled plasmid pCro-nfE6, 488 DNA harvested from E. co//':pCro- 

1 0 nfE6 I488 cells into the BL-21 strain of E. coli (available from Novagen). The amount of 
expression PHIS-;PfE65 40 was confirmed by immunoblot using the method described 
immediately above. 

E. coli cells expressing PHIS-PfE6 540 protein were harvested from about 2 liters 
of media and suspended in about 140 ml of 50 mM Tris, pH 8.0, 50 mM NaCl, 0.1 mM 

15 phenylmethylsulfonylfluoride (PMSF) (Solubilization Buffer). The cells were broken 
by passage through a microfluidizer at 30 psi for 30 cycles. The sample was centrifuged 
at about 16,000 X g for 30 min at 4°C. The supernatant (SI) was recovered and the 
pellet was resuspended in about 80 ml of Solubilization Buffer and centrifuged at about 
16,000 X g for 30 min at 4°C. The supernatant (S2) was recovered and the pellet was 

20 resuspended in about 80 ml of Solubilization Buffer containing 0.1% Triton-XlOO and 
centrifuged at about 16,000 X g for 30 min at 4°C. The supernatant (S3) was recovered 
and the pellet was resuspended in about 140 mis 50 mM Tris, pH 8.0, 8 M Urea, 0.1 M 
PMSF and centrifuged at about 16,000 X g. The supernatant (S4) was recovered and the 
pellet was resuspended in 40 mis 50 mM Tris, 8 M Urea, 0.1 M PMSF. Aliquots of each 

25 pellet and supernatant were analyz ^d >y SDS-PAGE and immunoblot using the T7 tag 
monoclonal antibody described above. The results indicated that the PHIS-PfE6 540 
protein was located in the final supernatant (S4). The PHIS-PfE6 540 protein was loaded 
onto a 5.0 ml, Metal chelating HiTrap column charged with NiCl 2 (obtained from 
Pharmacia Biotech Inc., Piscataway, NJ), previously equilibrated with 50 mM Tris, 1 
30 mM PMSF, 1 mM p-mercaptoethanol (PME), 8 M urea, pH 8.0 (Buffer A). The column 
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was washed with 10 column volumes (cv) of Buffer A and then with 10 cv with 50 mM 
Tris, 25 mM sodium acetate, 1 mM PMSF, 1 mM pME, 8 M urea, pH 6.0 (Buffer B) to 
remove loosely bound proteins. Bound PHIS-PfE6 540 protein was eluted with 10 cv of 
50 mM Tris, 25 mM sodium acetate, 1 mM PMSF, 1 mM PME, 8 M urea, pH 4.0 
5 (Buffer C). Column fractions were analyzed for the presence of PHIS-PfE6 540 protein by 
immunoblot using the T7 tag monoclonal antibody as described above. The results 
indicated that the majority of the PHIS-PfE6 540 protein was eluted by Buffer C. The 
fractions containing the PHIS-PfE6 540 protein were combined and loaded onto a 5 ml SP- 
Sepharose HiTrap column (obtained from Pharmacia Biotech Inc.) previously 

10 equilibrated with 50 mM Tris, 25 mM Sodium Acetate, 1 mM PMSF, 1 mM PME, S M 
Urea, pH 4.5 (SP-Sepharose Buffer). The column was washed with SP-Sepharose 
Buffer until most of the unbound protein was removed. Bound protein was eluted with 
an increasing salt gradient to 1 M NaCl over 100 ml (20 cv) in SP-sepharose buffer. 
Column fractions were analyzed for the presence of PHIS-PfE6 540 protein by 

15 immunoblot using the T7 tag monoclonal antibody as described above. The results 
indicated that the PHIS-PfE6 540 protein was eluted at about 0.75 M NaCl. 

The purified PHIS-PfE6 I48S protein was used to produce an anti-M6 polyclonal 
antiserum as follows. Rabbits were immunized with PHIS-PfE6 l488 protein diluted to a 
concentration of about 0.1 mg/ml in PBS. One milliliter of the dilution was mixed 1:1 

20 mix with Complete Freunds Adjuvant. In the primary immunization, about 500 ^1 of the 
1:1 mix was injected subcutaneously into 5 different sites (0.1 ml/site) and 500 (il was 
injected intradermally into 5 different sites (0.1 ml/site) on the rabbit. Booster shots 
were administered to the rabbit intramuscularly in 4 sites using 250 |il/site of a 1:1 mix 
of PHIS-PfE6 I4g8 protein with Incomplete Freunds Adjuvant. The booster shots were 

25 adrr mastered at days 14 and 35. Serum samples were obtained prior to immunization 
(pre-bleed), and at day 14 after primary immunization and day 14 after the first and 
second boost. 

C. Flea esterase protein PHIS-PfE9 528 was produced in the following 
manner. A pair of primers was used to amplify DNA from flea esterase nucleic acid 
30 molecule nfE9 200 7 produced as described in Example 1 1 . A sense primer containing an 
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BamUl site having the nucleic acid sequence 5' -TTC CGG ATC CGG CTG ATC TAC 
AAG TGA CTT TG - 3' (denoted SEQ ID NO:64; BamKl site shown in bold), was used 
in combination with an anti-sense primer containing a XJwl site having the nucleic acid 
sequence 5' TGG TAC TCG AGT CAT AAA AAT TTA TTC CAA AAT C 3' (denoted 
SEQ ID NO:65; Xlwl site shown in bold). A PCR product was derived from nfE9 200 7, 
and is referred to herein as nfE9 1540) having nucleic acid sequence SEQ ID NO:5 1 . The 
PCR product was digested with Band and Xliol restriction endonucleases, gel purified 
and subcloned into expression vector pTrcHisB (available from InVitrogen). The 
resultant recombinant molecule, referred to herein as P Trc-nfE9 1540 , was transformed into 
E. coli HB101 competent cells (available from Gibco BRL) to form recombinant cell E. 
co//:pTrc-nfE9 1 540- 

The recombinant cells were cultured using the method described in Section A of 
this example. Immunoblot analysis of recombinant cell E. co/;:pTrc-nfE9 1540 lysate 
using a T7 tag monoclonal antibody directed against the fusion portion of the 
15 recombinant PHIS-PfE9 52S fusion protein identified proteins of appropriate size, namely 
an about 59 kD protein for each fusion protein. 

Expression of the recombinant PHIS-PfE9 528 fusion protein was improved by 
transforming supercoiled plasmid P Trc-nfE9 1584 DNA harvested from E. co//:pTrc- 
nfE9 1540 cells into the BL-21 strain of E. coli. The amount of expression PHIS-PfE9 52S 
20 was confirmed by immunoblot using the method described immediately above. 

Two liters of media from cultures of E. coli cells expressing PHIS-PfE9 528 
protein were harvested and S4 supernatant was prepared using the method described 
above in section B. The PHIS-PfE9 528 protein contained in the S4 supernatant was 
loaded onto a 5.0 ml, Metal chelating HiTrap column, charged with NiCl 2 (available 
25 from Pharmacia Biotech Inc., Piscataway, NJ), previously equilibrated with 50 mM Tris, 
1 mM PMSF, 1 mM PME, 8 M urea, P H 8.U (Buffer A). The column was washed with 
5 cv of Buffer A until all unbound protein was removed. Bound protein was eluted with 
a linear gradient from Buffer A to 50 mM Tris, 1 mM PMSF, 1 mM PME, S M urea, 1 
M NaCl, pH 4.0. Column fractions were analyzed for the presence of PHIS-PfE9 52S 
30 protein by immunoblot using the T7 tag monoclonal antibody as described above. The 
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results indicated that the majority of the PHIS-PfE9 S28 protein was eluted at about 250 
mM NaCl. The fractions containing the PHIS-PfE9 528 protein were combined and 
loaded onto a C4-reversed phase column (obtained from Vydak, Hesperia, CA), 
previously equilibrated with 0.05% trifluoroacetic acid (TFA). The column was washed 
with 0.05% TFA until all unbound protein was removed. Bound proteins were eluted 
with a linear gradient from 0.05% TFA to 0.05% TFA in acetonitrile. Column fractions 
were analyzed for the presence of PHIS-PfE9 528 protein by immunoblot using the T7 tag 
monoclonal antibody as described above. The results indicated that the PHIS-PfE9 528 
protein was eluted at about 40% acetonitrile. The fractions containing the PHIS-PfE9 528 
protein were combined and loaded onto a 5 ml Q-Sepharose HiTrap column previously 
equilibrated with 50 mM Tris, 25 mM Sodium Acetate, 1 mM PMSF, 1 mM PME, S M 
Urea, pH 8.5 (Q-Sepharose Buffer). The column was washed with Q-Sepharose Buffer 
until all unbound protein was removed. Bound protein was eluted with an increasing 
salt gradient to 1 M NaCl over 100 ml (20 cv) in Q-sepharose buffer. Column fractions 
were analyzed for the presence of PHIS-PfE9 528 protein by immunoblot using the T7 tag 
monoclonal antibody as described above. The results indicated that the PHIS-PfE9 528 
protein was eluted at about 0.3 M NaCl. 

The purified PHIS-PfE9 528 protein was used to produce an anti-Pi polyclonal 
antiserum as follows. Rabbits were immunized with PHIS-PfE9 528 protein diluted to a 
concentration of about 0. 1 mg/ml in PBS. One milliliter of the dilution was mixed 1 : 1 
mix with Complete Freunds Adjuvant. In the primary immunization, about 500 pi of the 
1:1 mix was injected subcutaneously into 5 different sites (0.1 ml/site) and 500 pi was 
injected intradermal^ into 5 different sites (0.1 ml/site) on the rabbit. Booster shots 
were administered to the rabbit intramuscularly in 4 sites using 250 ul/site of a 1:1 mix 
of PHIS-PfE9 528 protein with Inc -irplete Freunds Adjuvant The booster shots were 
administered at days 14 and 35. Serum samples were obtained prior to immunization 
(pre-bleed), and at day 14 after primary immunization and day 14 after the first and 
second boost. 

D. Flea esterase protein PHIS-PfE7 275 was produced in the following 
manner. A 650-bp fragment was produced by digesting nfE7 2836 DNA with the 
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restriction enzymes BamUl and Bglll. The BamYLl and Bglll fragment derived from 
nfE7 2g36 is referred to herein as nfE7 650 , having nucleic acid sequence SEQ ID NO:72 
and amino acid SEQ ID NO:73. The fragment was purified using a Qiaquick™ Kit 
(available from Qiagen, Santa Clarita, CA), according to methods provided by the 

5 manufacturer. The purified fragment was subcloned into expression vector pTrcHisC 
which had been digested with BamMl and Bglll. The resultant recombinant molecule, 
referred to herein as pTrc-nfE7 6S0 was transformed into E. coli DH-5a competent cells 
(available from Gibco BRL) to form recombinant cell E. co//:pTrc-nfE7 050 . 

The recombinant cells were cultured using the method described above in section 

10 A. Immunoblot analysis of recombinant cell E. co//:pTrc-nfE7 650 lysate using a T7 tag 
monoclonal antibody directed against the fusion portion of the recombinant PHIS- 
PfE7 275 fusion protein identified proteins of appropriate size, namely an about 35 kD 
protein for each fusion protein. 

Expression of the recombinant fusion protein was improved by transforming 

15 supercoiled plasmid pTrc-nfE7 650 DNA harvested fronv£. co//:pTrc-nfE7 650 cells into the 
BL-21 strain of E. coli. The amount of expression E. co//:pTrc-nfE7 650 was confirmed 
by immunoblot using the method described immediately above. 
Example 13 . 

This Example demonstrates the production of esterase proteins of the present 

20 invention in eukaryotic cells. 

A. Recombinant molecule pBv-nfE7 178g , containing a flea esterase nucleic 
acid molecule spanning nucleotides from about 99 through about 1886 of SEQ ID 
NO:24, and pBv-nfE8 17gs , containing a flea esterase nucleic acid molecule spanning 
nucleotides from about 99 through about 1883 of SEQ ID NO:30 each, operatively 

25 linkeu to baculovirus poh he Iron transcription control sequences were produced in the 
following manner. In order to subclone a flea esterase nucleic acid molecule into 
baculovirus expression vectors, flea esterase nucleic acid molecule-containing fragments 
were separately PCR amplified from nfE7 2g36 or nfE8 2S01 DNA. A PCR fragment of 
1858 nucleotides, named nfE7 lg58 , was amplified from nfE7 2g36 using a sense primer 

30 El 1 13 FWD having the nucleic acid sequence 5'- AAAACTGCAG TATAAATATG 
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TTACCTCACA GTAGTG - 3' (SEQ ID NO:49; Pstl site shown in bold) and an 
antisense primer E 1 1 13/2212 REV having the nucleic acid sequence 5*- 
TGCTCTAGAT TATCTAATAC TTCCTTCATT ACAG (SEQ ID NO:50; Xbal site 
shown in bold). A PCR fragment of 1858 nucleotides, named nfE8 1858 , was amplified 
from nfE8 28 oi using a sense primer E2212 FWD having the nucleic acid sequence 5'- 
AAAACTGCAG TATAAATATG TTACCTCACA GTGCATTAG -3' (SEQ ID 
NO:66; Pstl site shown in bold), and the antisense primer E 1 1 13/2212 REV. The N- 
terminal primer was designed from the pol h sequence of baculovirus with modifications 
to enhance expression in the baculovirus system. 

In order to produce a baculovirus recombinant molecule capable of directing the 
production of PfE7 5% , the about 1,802 base pair PCR product (referred to as Bv-nfE7 IS02 ) 
was digested with Pstl and Xbal and subcloned into unique Pstl and Xbal sites of 
pVL1392 baculovirus shuttle plasmid (available from Pharmingen, San Diego, CA) to 
produce the recombinant molecule referred to herein as pVL-nfE7 lg02 . 

In order to produce a baculovirus recombinant molecule capable of directing the 
production of PfE8 595 , the about 1,792 base pair PCR product (referred to as Bv-nfES 1792 ) 
was digested with Pstl and Xbal and subcloned into Pstl and Xbal digested to produce 
the recombinant molecule referred to herein as pVL-nfE8 1792 . 

The resultant recombinant molecules, pVL-nfE7 I802 and pVL-nfE8 1792 , were 
verified for proper insert orientation by restriction mapping. Such a recombinant 
molecule can be co-transfected with a linear Baculogold baculovirus DNA (available 
from Pharmingen) into S.frugiperda Sf9 cells (available from InVitrogen) to form the 
recombinant cells denoted S. frugiperda:pVL-n{E7 m2 and S.frugiperda:pVL-fE8 n92 . S. 
fmgiperda:pVL-nfE7 m2 can be cultured in order to produce a flea esterase protein 
PfE7 596 . S.frugiperda:pVL-nfE8 X792 can be cultured in order to produce a flea esterase 
protein PfE8 595 . 

B. Recombinant molecule pBv-PfE9 528 , containing a flea esterase nucleic 
acid molecule spanning nucleotides from 14 through 1595 of SEQ ID NO:36, 
operatively linked to baculovirus polyhedron transcription control sequences were 
produced in the following manner. In order to subclone a flea esterase nucleic acid 
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molecule into baculovirus expression vectors, a flea esterase nucleic acid molecule- 
containing fragment was PCR amplified from nfE9 2007 DNA. A PCR fragment of about 
1600 nucleotides, named nfE9 1600 , was amplified from nfE9 2007 using a sense primer 
P121B1 Sense having the nucleic acid sequence 5'- CGC GGA TCC GCT GAT CTA 
CAA GTG ACT TTG C - 3' (SEQ ID NO:75; BamUI site shown in bold) and an 
antisense primer P121B1 Anti having the nucleic acid sequence 5'- CCG AGC GGC 
CGC ATA AAA ATT TAT TCC AAA ATC TAA GTC G-3' (SEQ ID NO:76; Notl site 
shown in bold). The N-terminal primer was designed from the pol h sequence of 
baculovirus with modifications to enhance expression in the baculovirus system. 

In order to produce a baculovirus recombinant molecule capable of directing the 
production of PfE9 52S , the about 1,600 base pair PCR product (referred to as Bv-nfE9 J600 ) 
was digested with BomHl and Notl and subcloned into unique BamUI and Notl sites of 
pVL1393 baculovirus shuttle plasmid (available from Pharmingen, San Diego, CA) to 
produce the recombinant molecule referred to herein as pVL-nfE9, 600 . 

The resultant recombinant molecule, pVL-nfE9, 600 , was verified for proper insert 
orientation by restriction mapping. Such a recombinant molecule can be co-transfected 
with a linear Baculogold baculovirus DNA into S.fmgiperda Sf9 cells to form the 
recombinant cells denoted S.frugiperda:pVL-x\{E9 lG00 . S.frugiperda:pVL-nfE9 {00 o can 
be cultured in order to produce a flea esterase protein PfE9 528 . 

An immunoblot of supernatant from cultures of S.frugiperda:pVL-nfE9 xm cells 
producing the flea esterase protein PfE9 528 was performed using the anti-Pi polyclonal 
antiserum described in detail in Example 12. Blots were incubated using serum samples 
from the pre-bleed or from serum collected 14 days after the first boost of the rabbit. 
Analysis of the supernatent from cultures of S.frugiperda:pVL-nfE9 l600 cells identified 
an about 66 kD protein 

C. Recombinant molecule pBv-PfE6 530 , containing a flea esterase nucleic 
acid molecule spanning nucleotides from 50 through 1701 of SEQ ID NO:18, 
operatively linked to baculovirus polyhedron transcription control sequences were 
produced in the following manner. In order to subclone a flea esterase nucleic acid 
molecule into baculovirus expression vectors, a flea esterase nucleic acid molecule- 
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containing fragment was PCR amplified from nfE6, 792 DNA. A PCR fragment of about 
1679 nucleotides, named nfE10 1679 , was amplified from nfE6 l792 using a sense primer 
M6M32 Sense having the nucleic acid sequence 5'- GCG AGG CCT TAT AAA TAT 
GTC TCG TGT TAT TTT TTT AAG TTG - 3' (SEQ ID NO:75; Stul site shown in 
bold) and an antisense primer M6M32 Anti having the nucleic acid sequence 5'- GCA 
CTG CAG TTA TTG ACT GTG CAA AGT TTT TGT GG-3' (SEQ ID NO:76; Pstl 
site shown in bold). The N-terminal primer was designed from the pol h sequence of 
baculovirus with modifications to enhance expression in the baculovirus system. 

In order to produce a baculovirus recombinant molecule capable of directing the 
production of PfE6 530 , the about 1,679 base pair PCR product (referred to as Bv-nfE6i 679 ) 
was digested with Stul and Pstl and subcloned into unique Stul and Pstl sites of FAST 
BAC™ baculovirus shuttle plasmid (obtained from Gibco-BRL) to produce the 
recombinant molecule referred to herein as pFB-nfE6 I679 . 

The resultant recombinant molecule, pFB-nfE6i 679 , was verified for proper insert 
orientation by restriction mapping. Such a recombinant molecule can be transformed 
into E. coli strain DH10 (obtained from Gibco-BRL) according to the manufacturer's 
instructions. The pFB-nfE6 l679 isolated from the transformed DH10 cells can then be co- 
transfected with a linear Baculogold baculovirus DNA into S. frugiperda Sf9 cells to 
form the recombinant cells denoted S. frugiperda-.pFB-nfEG^. S.frugiperda:pFB- 
nfE6 1679 can be cultured in order to produce a flea esterase protein PfE6 530 . 

An immunoblot of supernatant from cultures of S. frugiperda :pFB-nfE6i 079 cells 
producing the flea esterase protein PfE6 530 was performed using the anti-M6 polyclonal 
antiserum described in detail in Example 12. Blots were incubated using serum samples 
from the pre-bleed or from serum collected 14 days after the first boost of the rabbit. 
Analysis of the supernatent fro*n cultures of S.f'u ;/>e/Yfo:pFB-nfE6 1679 cells identified 
an about 66 kD protein. 

N-terminal amino acid sequence was obtained using standard methods for the 
about 66 kD protein identified using the anti-M6 polyclonal antiserum. The N-terminal 
amino acid sequence was determined to be identical to the N-terminal amino acid 
sequence of SEQ ID NO:44. 
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Example 14 

This example describes the purification of carboxylesterase protein from fed flea 
midguts. 

About 43,000 cat blood-fed adult flea midguts were collected and prepared as 

5 previously described in Example 1. The extract was then added in 2 aliquots to columns 
containing about 1 to about 2 ml of /?-aminobenzamidine linked agarose beads (available 
from Sigma), equilibrated in 50 mM Tris (pH 8.0), 400 mM NaCl, and incubated 
overnight at 4°C. The columns were then drained to remove unbound protein and the 
two aliquots of unbound protein were combined. The collected unbound protein was 

10 then concentrated and diafiltered into a total volume of about 16 ml of 25 mM Tris (pH 
8), 10 mM NaCl using an Ultrafree-20 10 kD centrifugal concentrator (available from 
Millipore, Bedford, MA). 

Aliquots of about 8 ml were loaded onto an Uno Q6 anion exchange column 
(available from Bio-Rad, Hercules, CA) equilibrated in 25 mM Tris (pH 8), 10 mM 

15 NaCl, operated on a BioLogic liquid chromatography system (available from Bio-Rad). 
The column was washed with 25 mM Tris (pH 8), 10 mM NaCl until all unbound 
protein was removed. Protein bound to the column was then eluted with a linear 
gradient from 10 mM to 1 M NaCl in 25 mM Tris, pH 8. Fractions were assayed for CE 
activity using the assay described previously. The results indicated that CE activity was 

20 eluted at about 220 mM NaCl. 

Fractions containing CE activity were pooled and diafiltered into a total volume 
of about 3 ml of 20 mM MES buffer (2-(N-morpholino)ethanesulfonic acid), pH 6.0, 
containing 10 mM NaCl, in preparation for cation exchange chromatography. The 
sample was then applied to an Uno SI cation exchange column (available from Bio-Rad) 

2j equilibrated in ! 1FS buffer. The column was washed with MES buffer until all unbound 
protein was removed. Protein bound to the column was then eluted with a linear 
gradient from 10 mM to 1 M NaCl in 20 mM MES buffer, pH 6. Fractions were assayed 
for CE activity using the assay described previously. The results indicated that CE 
activity was not retained on the cation exchange column using the above conditions, and 

30 all of the activity was found in the flow-through fractions. 
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Fractions containing CE activity were pooled and diafiltered into a total volume 
of about 3 ml of 25 mM Tris (pH 8), 10 mM NaCl, in preparation for an additional anion 
exchange chromatography step. The sample was then applied to a Bio-Scale Q2 anion 
exchange column (available from Bio-Rad). The column was washed with 25 mM Tris 
5 (pH 8), 10 mM NaCl until all unbound protein was removed. Protein bound to the 

column was then eluted with a linear gradient from 10 mM to 1 M NaCl in 25 mM Tris, 
pH 8. Fractions were assayed for CE activity using the assay described previously. The 
results indicated that CE activity was eluted at about 130 mM NaCl. 

A fraction containing CE activity was diluted into a total volume of about 4 ml of 

10 10 mM phosphate buffer, pH 7.2 containing 10 mM NaCl, in preparation for 

hydroxyapatite chromatography. The sample was then applied to a Bio-Scale CHT2-I 
column (available from Bio-Rad) at a flow rate of about 0.5 ml/min. The column was 
washed with 10 mM phosphate buffer, pH 7.2 containing 10 mM NaCl until all unbound 
protein was removed. Protein bound to the column was then eluted with a linear 

15 gradient from 10 mM phosphate buffer, pH 7.2 containing 10 mM NaCl to 0.5 M 10 
mM phosphate buffer, pH 6.5 containing 10 mM NaCl. Fractions were assayed for CE 
activity using the assay described previously. The results indicated that CE activity was 
eluted at about 200 mM phosphate. 
Example 15 

20 This example describes the purification of a carboxyl esterase protein from 

wandering flea larvae. 

About 120,000 bovine blood-fed adult wandering flea larvae were homogenized 
in 3 batches of about 40,000 wandering larvae in each batch, in Tris buffered saline 
(TBS), pH 8.0 as previously described, except that about 1.2 mg of phenylthiourea was 

25 added to each ml of TBS during the extraction procedure to inhibit cross linkni 0 
reactions. The extracts were dialyzed against 2 changes of about 2 L of 10 mM 
phosphate buffer, pH 7.2 containing 10 mM NaCl in preparation for hydroxyapatite 
batch chromatography. The samples were then filtered through glass Acrodiscs © 
(available from Gelman Sciences, Ann Arbor, MI) and added to 14 g of Macro-Prep 

30 Ceramic Hydroxyapatite, Type I, 40 |im beads (available from Bio-Rad), previously 
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equilibrated in 10 mM phosphate buffer, pH 7.2 containing 10 mM NaCl. The extracts 
and beads were rocked at room temperature for about 30 minutes. Following incubation, 
the beads were centrifuged for about 5 minutes at 500 x g and the supernatants removed. 
The beads were washed with about 40 ml 10 mM phosphate buffer, pH 7.2 containing 
5 10 mM NaCl, centrifuged as above, and washed and centrifuged again to eliminate all 
unbound protein. Bound proteins were eluted by washing the beads with about 40 ml of 
each of 100 mM, 200 mM, 300 mM, and 400 mM phosphate buffer, pH 6.5 containing 
1 0 mM NaCl. Following elution, the supernatants from each concentration of phosphate 
buffer were tested for juvenile hormone esterase activity as described previously in 

10 Example 7. The juvenile hormone esterase activity eluted at different phosphate 

concentrations in each batch, but the activity was generally found in the 200 mM to 300 
mM phosphate fractions. 

The fractions that contained the highest juvenile hormone esterase activity were 
combined and diafiltered into a total volume of about 50 ml of 10 mM phosphate buffer, 

15 pH 7.2 containing 10 mM NaCl using a stirred cell concentrator fitted with a YM10 

ultrafiltration membrane (available from Amicon, Beverly, MA). Aliquots of about 5 ml 
to 10 ml were applied to a chromatography column containing about 10 ml of Macro- 
Prep Ceramic Hydroxyapatite, Type I, 20 [im beads, previously equilibrated with 10 mM 
phosphate buffer, pH 7.2 containing 10 mM NaCl. The column was washed with 10 

20 mM phosphate buffer, pH 7.2 containing 10 mM NaCl until all unbound protein was 
removed. Protein bound to the column was then eluted with a linear gradient from 10 
mM phosphate buffer, pH 7.2 containing 10 % mM NaCl to 0.5 M 10 mM phosphate 
buffer, pH 6.5 containing 10 mM NaCl. Fractions were assayed for carboxylesterase 
activity using the assay described previously. The results indicated that carboxylesterase 

25 activity was eluted at about 160 mM phosphate. 

The fractions that contained the highest carboxylesterase activity were combined 
and diafiltered into a total volume of about 15 ml of 20 mM sodium acetate buffer, pH 
4.0 in preparation for cation exchange chromatography. Aliquots of about 3 ml were 
applied to a PolyCat A cation exchange column (available from PolyLC, Columbia, 

30 MD) equilibrated in 20 mM sodium acetate buffer, pH 6.0, operated on a Waters high 
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performance liquid chromatography system (available from Waters Corporation, 
Milford, MA). The column was washed with 20 mM sodium acetate buffer, pH 6.0 until 
all unbound protein was removed. Protein bound to the column was then eluted with a 
linear gradient from 20 mM sodium acetate buffer, pH 6.0 to 20 mM sodium acetate 
5 buffer, pH 6.0 containing 1 M NaCl. Fractions were assayed for CE activity using the 
assay described previously. The results indicated that there were two pools of CE 
activity. The first pool was not retained on the cation exchange column, and the second 
pool was eluted at about 170 mM NaCl. 

The fractions from the second pool that contained the highest carboxylesterase 

10 activity were combined and diafiltered into a total volume of about 10 ml of 25 mM Tris 
(pH 8), 10 mM NaCl, in preparation for anion exchange chromatography. The sample 
was then applied to a Bio-Scale Q2 anion exchange column (available from Bio-Rad). 
The column was washed with 25 mM Tris (pH 8), 10 mM NaCl until all unbound 
protein was removed. Protein bound to the column was then eluted with a linear 

15 gradient from 10 mM to 1 M NaCl in 25 mM Tris, pH 8. Fractions were assayed for 

carboxylesterase activity using the assay described previously. The results indicated that 
carboxylesterase activity was eluted at about 350 mM NaCl. 

Fractions containing carboxylesterase activity were combined and concentrated 
to about 175 jal using a Centricon 10 centrifugal concentrator (available from Amicon, 

20 Beverly, MA) in preparation for size exclusion chromatography. The sample was 

applied to a Bio-Select SEC 125-5 size exclusion chromatography column (available 
from Bio-Rad), previously equilibrated in TBS, pH 7.2. About 250 jil fractions were 
then collected. The fractions were assayed for carboxylesterase activity using the assay 
described previously. The results indicated that the carboxylesterase activity was eluted 

25 in about 5.5 to u lal of buffer, corresr^n ling to a molecular weight of about 40 to 100 
kDa based on the elution volumes of gel filtration molecular weight standard proteins 
(available from Sigma, St. Louis, MO). 
Example 16 

This example describes the purification of juvenile hormone esterase activity 
30 from unfed adult flea midguts by affinity chromatography. 
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About 16,000 unfed adult flea midguts were collected in 20 mM Tris buffer (pH 
7.7), containing 130 mM NaCl, 1 mM sodium EDTA, 1 mM Pefabloc ® (available from 
Boehringer Mannheim, Indianapolis, IN), 1 microgram/ml (|ig/ml) leupeptin and 1 
Hg/ml pepstatin. The midguts were homogenized by freeze-fracture and sonication, and 
then centrifuged at about 14,000 x g for 20 min. The soluble material from the 
centrifugation step was recovered, diafiltered into Tris buffered saline (TBS), and 
applied to a disposable plastic column containing about 1 ml of 3-[(4 f - 
mercapto)butylthio]-l,l,l-trifluoropropan-2-one linked Sepharose 6B beads, prepared 
similarly to the method described by Venkatesh et al. {J. Biol. Chenu, Vol. 265, No. 35, 
21727-21732, 1990) (the 3-[(4 , -mercapto)butylthio]-l,l,l-trifluoropropan-2-one was a 
gift from Novartis Corp., Basel, Switzerland; and the Epoxy-activated Sepharose 6B is 
available from Pharmacia Biotech Inc., Piscataway, NJ). After overnight incubation at 4 
°C, the column was drained and the beads were washed with about 10 ml TBS, then 
about 10 ml TBS containing 0.1% (w/v) n-octylglucoside (OG; available from 
Boehringer Mannheim). The pre-column, flow-through, and wash fractions were tested 
for juvenile hormone esterase activity by the method previously described above in 
Example 7. The results indicate that the flow-through fraction contained approximately 
40% less juvenile hormone esterase activity than the pre-column material, and that the 
washes contained very little activity. 

Bound protein was eluted from the beads by adding about 10 ml of TBS 
containing 0.1 % (w/v) OG and 1 mM 3-octylthio-l,l,l-trifluoropropan-2-one (OTFP; a 
gift from Novartis Corp.). After a 2 hour incubation at 4°C, about 5 ml of the eluate was 
collected, and the remaining 5 ml was incubated with the beads overnight at 4°C. The 
following day, the beads were drained, the eluate collected, and an additional 10 ml of 
TBS cr nt- lining 0.1 % (w/v) OG and 1 mM OTFP was added to the beads. After an 
overnight incubation at 4°C, the beads were drained and the eluate collected. The final 
10 ml elution step was repeated 3 additional times so that we had 6 eluted fractions. The 
first elution fraction was dialyzed overnight twice against 1 liter of fresh TBS to remove 
excess OTFP. The second elution fraction was also dialyzed overnight against 1 liter of 
fresh TBS to remove OTFP. The third through sixth elution fractions were not dialyzed. 
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All six eluted fractions were tested for juvenile hormone esterase activity by the method 
previously described above in Example 7. The results indicate that only the third elution 
fraction contained detectable juvenile hormone esterase activity. Analysis of the eluted 
fractions by silver-stained SDS-PAGE indicated that several proteins were specifically 
5 bound to the affinity beads and were eluted by OTFP. The apparent molecular weights 
of these proteins, as determined by SDS-PAGE, were about 66 kDa, 55 kDa, and 33 
kDa. About 3.5 ml of each elution fraction were combined and concentrated to about 
110 ii\ using a Centriplus 10 centrifugal concentrator (available from Amicon, Beverly, 
MA). This pool was separated by SDS-PAGE and blotted onto a polyvinylidene 

10 difluoride (PVDF) membrane as described previously in Example 5. The stained protein 
band at about 66 kDa was excised and subjected to N-terminal sequence analysis as 
described previously. 

The results indicated that the N-terminal amino acid sequence of the putative 66 
kDa juvenile hormone esterase protein was DL y/g V k/y/g v/q/n LQGTLKGKE 

15 (denoted herein as SEQ ID NO:74), in which the lower case letters designate 

uncertainties. Below is shown a comparison between different esterase amino acid 
sequences of the present invention. 

SEQ ID NO:74: DL (y/g) V (k/y/g) (v/q/n) LQGTLKGKE 

SEQIDNO:37: DL Q V T L LQGTLKGKE 
20 (Residues 3-17) 

Example 17 

This example describes the purification of an active recombinant juvenile 
hormone esterase protein from baculovirus supernatants. 

About 1 liter of supernatant from cultures of S.frugiperda:pVL-nfE9 um cells 
25 producing the flea esterase protein PfE9 528 was brought to about 50% sanation with 
ammonium sulfate and centrifuged at about 20000 x g for about 30 minutes at 4°C to 
pellet the precipitated material. After centrifugation, the pellet was retained and the 
supernatant was brought to about 100% saturation with ammonium sulfate and 
centrifuged as above. The material in both pellets were resuspended separately in about 
30 35 ml of Tris buffered saline (TBS), pH 8.0. The resuspended pellets were assayed for 
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the presence of flea esterase protein PfE9 528 using standard Western blot techniques and 
a polyclonal antiserum that binds specifically to PfE9 528 protein. Briefly, a rabbit was 
immunized with PHIS-PfE9 528 protein purified from E. coli :pTrc-nfE9 1584 cells 
(described above in Example 12C) and boosted using standard procedures. The results 
indicated that the flea esterase protein PfE9 528 was present in the S. frugiperdaipVL- 
nfE9 1600 supernatants and the protein was precipitated by adjusting the ammonium 
sulfate concentration from about 50% saturation to about 100% saturation. 

The resuspended flea protein PfE9 528 was diafiltered into about 10 ml of 25 mM 
Tris (pH 8.0), 10 mM NaCl using an Ultrafree-20 10 kD centrifugal concentrator in 
preparation for anion exchange chromatography. Aliquots of about 5 ml were loaded 
onto an Uno Q6 anion exchange column equilibrated in 25 mM Tris (pH 8.0), 10 mM 
NaCl. The column was washed with 25 mM Tris (pH 8.0), 10 mM NaCl until most of 
the unbound protein was removed. Protein bound to the column was then eluted with a 
linear gradient from 10 mM to 1 M NaCl in 25 mM Tris buffer (pH 8.0). Fractions were 
assayed for the presence of flea esterase protein PfE9 528 by the immunoblot method 
described above. The results indicated that the flea esterase protein PfE9 528 was eluted at 
about 200 mM NaCl. 

Fractions containing the flea esterase protein PfE9 523 were pooled and 
concentrated to about 440 fil using a Centricon 10 kD centrifugal concentrator in 
preparation for size exclusion chromatography. The sample was applied in 3 aliquots to 
a Bio-Select SEC 125-5 size exclusion chromatography column (available from Bio- 
Rad), previously equilibrated in TBS, pH 7.2. The column was eluted with TBS, pH 7.2 
at a flow rate of about 0.5 ml/min, and fractions of about 250 \x\ were collected. The 
fractions were assayed for the presence of flea esterase protein PfE9 52S by the 
immunoblot method described above, " A he results indicate ' t 1 at the flea esterase protein 
PfE9 528 was eluted with about 6 ml of buffer, corresponding to a molecular weight of 
about 40 to 100 kDa based on the elution volumes of gel filtration molecular weight 
standard proteins (available from Sigma, St. Louis, MO). 

Fractions containing flea esterase protein PfE9 528 were then assayed for juvenile 
hormone esterase activity as described in Example 7 and carboxylesterase activity as 
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described in Example 2. The results indicated that the purified flea esterase protein 
PfE9 528 had both juvenile hormone esterase activity and carboxy lest erase activity. 
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SEQUENCE LISTING 

GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Heska Corporation 

(B) STREET: 1825 Sharp Point Drive 

(C) CITY: Fort Collins 

(D) STATE: CO 

(E) COUNTRY: US 

(F) POSTAL CODE (ZIP) : 80525 

(G) TELEPHONE: (970) 493-7272 

(H) TELEFAX: (970) 484-9505 

(ii) TITLE OF INVENTION: Novel Carboxylesterase Nucleic Acid 

Molecules, Proteins and Uses Thereof 

(iii) NUMBER OF SEQUENCES: 76 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: LAHIVE & COCKFIELD, LLP 

(B) STREET: 2 8 STATE STREET 

(C) CITY: BOSTON 

(D) STATE: MA 

(E) COUNTRY: US 

(F) ZIP : 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: ASCII DOS TEXT 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

( C ) CLASS I F I CATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/747,221 

(B) FILING DATE: November 12, 1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME : Rothenberger , Scott D. 

(B) REGISTRATION NUMBER: 41,277 

(C) REFERENCE/DOCKET NUMBER: HKV-010PC (FC-1-C1 -PCT) 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: ( 6 17 ) 2 27 - 74 0 0 

(B) TELEFAX : (617) 742-4214 

) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 nucleotides 

(B) TYPE: nucleic acid 
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10 



15 



20 



25 



(ii) 
(iii) 

(iv) 

(v) 

(vi) 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: CDNA 
FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 92.. 400 



FEATURE : 

(A) NAME / KEY : 

(B) LOCATION: 

FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 



Xaa 
218 



lie, Thr Lys or Arg 



5 



Xaa = Lys, Glu or Gin 
275, 329 



FEATURE : 

(A) NAME/ KEY : Xaa = Asn, Tyr or Asp 
332 



30 



35 



40 



(B) LOCATION: 

(vii) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TTTACATCAT TAATAAACAT AAATCTAATA AATCTTGTGG ATCAAGATCA 

AGTTTATTAG TGAGAGTGTT GGATTTGTGA AATATTTCAA A ATG AAT 

Met Asn 
1 

TCT TTA ATT GTA AAA ATT TCT CAA GGA GCT ATT GAG GGG AAG 
Ser Leu He Val Lys He Ser Gin Gly Ala He Glu Gly Lys 
5 10 15 

GAA ATG ATT AAT GAT AAT GGA AAG TCG TTT AGA GGA TTT TTG 
Glu Met He Asn Asp Asn Gly Lys Ser Phe Arg Gly Phe Leu 
20 25 30 

GGT ATA CCT TAT GCT AAA CCG CCT ATA GGA AAT CTT ANA TTT 
Gly He Pro Tyr Ala Lys Pro Pro He Gly Asn Leu Xaa Phe 
35 40 

AAG CCT CCT CAA AAG CCT GAT GAT TGG AAT GAT GTT CGA CCA 
Lys Pro Pro Gin Lys Pro Asp Asp Trp Asn Asp Val Arg Pro 
45 50 55 

GCT ACT GAA NAA GCA AAT GGT TGT AGA TCG AAA CAT ATG CTG 
Ala Thr Glu Xaa Ala Asn Gly Cys Arg Ser Lys His Met Leu 
60 65 70 

CAG CAT CAT ATT ATT GGA GAC NAA NAT TGT CTA TAC CTA AAC 
Gin His His He He Gly Asp Xaa Xaa Cys Leu Tyr Leu Asn 
75 80 85 

GTN TAT GTT CCA TTG ACT TCC AAA TTG GAG AAA CTA CCA GTA 
Val Tyr Val Pro Leu Thr Ser Lys Leu Glu Lys Leu Pro Val 
90 95 100 



50 
97 

139 

181 

223 

265 

307 

349 

391 
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401 

ATG TTC TGG G 
Met Phe Trp 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) FEATURE 

10 (A) NAME /KEY : Xaa = He, Thr, Lys or Arg 

(B) LOCATION: 4 3 

(iv) FEATURE 

(A) NAME/KEY: Xaa « Lys, Glu or Gin 

(B) LOCATION: 62, 80 

15 (v) FEATURE 

. ~ (A) NAME/ KEY: Xaa = Asn, Tyr or Asp 

(B) LOCATION: 81 

(vi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Asn Ser Leu He Val Lys He Ser Gin Gly Ala He Glu 
20 1 5 io 

Gly Lys Glu Met He Asn Asp Asn Gly Lys Ser Phe Arg Gly 
15 20 25 

Phe Leu Gly He Pro Tyr Ala Lys Pro Pro He Gly Asn Leu 
30 35 40 

25 Xaa Phe Lys Pro Pro Gin Lys Pro Asp Asp Trp Asn Asp Val 
45 50 55 

Arg Pro Ala Thr Glu Xaa Ala Asn Gly Cys Arg Ser Lys His 
60 65 70 

Met Leu Gin His His He He Gly Asp Xaa Xaa Cys Leu Tyr 
30 75 80 

Leu Asn Val Tyr Val Pro Leu Thr Ser Lys Leu Glu Lys Leu 
85 90 95 

Pro Val Met Phe Trp 
100 

35 (2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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AAA GAG TAT TTT AG 1 TAC ACA GGT GTA CCT TAT GCT AAA CCT 
Lys Glu Tyr Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro 
35 45 50 55 



CCT GTT GGA GAA CTT AGA TTT AAG CCT CCA CAG AAA GCT GAG 
Pro Val Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu 
60 65 

CCA TGG CAA GGT GTT TTC AAC GCC ACA TTA TAC GGA AAT GTG 
40 Pro Trp Gin Gly Val Phe Asn Ala Thr Leu Tyr Gly Asn Val 
70 7 5 80 



50 
100 
150 
200 
250 
300 
350 
400 
401 



(D) OPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CCCAGAACAT TACTGGTAGT TTCTCCAATT TGGAAGTCAA TGGAACATAN 
5 ACGTTTAGGT ATAGACAATN TTNGTCTCCA ATAATATGAT GCTGCAGCAT 
ATGTTTCGAT CTACAACCAT TTGCTTNTTC AGTAGCTGGT CGAACATCAT 
TCCAATCATC AGGCTTTTGA GGAGGCTTAA ATNTAAGATT TCCTATAGGC 
GGTTTAGCAT AAGGTATAC C CAAAAATCCT CTAAACGACT TTCCATTATC 
ATTAATCATT TCCTTCCCCT CAATAGCTCC TTGAGAAATT TTTACAATTA 
10 AAGAATTCAT TTTGAAATAT TTCACAAATC CAACACTCTC ACTAATAAAC 
TTGATCTTGA TCCACAAGAT TTATTAGATT TATGTTTATT AATGATGTAA 
A 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
15 (a) LENGTH: 364 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

20 (iii) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2 . . 364 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

G TCT CGT GTT ATT TTT TTA AGT TGT ATT TTT TTG TTT AGT 
25 Ser Arg Val He Phe Leu Ser Cys He Phe Leu Phe Ser 

1 5 10 

TTT AAT TTT ATA AAC TGT GAT TCC CCG ACT GTA ACT TTG CCC 8 2 

Phe Asn Phe He Asn Cys Asp Ser Pro Thr Val Thr Leu Pro 
15 20 25 

30 CAA GGC GAA TTG GTT GGA AAA GCT TTG ACG AAC GAA AAT GGA 
Gin Gly Glu Leu Val Gly Lys Ala Leu Thr Asn Glu Asn Gly 
30 35 40 



40 



124 



166 



208 



250 
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TGT AAA TCT TTA AAT TTC TTC TTG AAG AAA ATT GAA GGA GAC 2 92 

Cys Lys Ser Leu Asn Phe Phe Leu Lys Lys He Glu Gly Asp 
85 90 95 

GAA GAC TGC TTG GTA GTA AAC GTG TAC GCA CCA AAA ACA ACT 334 
5 Glu Asp Cys Leu Val Val Asn Val Tyr Ala Pro Lys Thr Thr 
100 105 HO 

TCT GAT AAA AAA CTT CCA GTA TTT TTC TGG 3 64 

Ser Asp Lys Lys Leu Pro Val Phe Phe Trp 
115 120 

10 (2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Ser Arg Val lie Phe Leu Ser Cys lie Phe Leu Phe Ser Phe 
1 " 5 10 

Asn Phe lie Asn Cys Asp Ser Pro Thr Val Thr Leu Pro Gin 
20 15 20 ' 25 

Gly Glu Leu Val Gly Lys Ala Leu Thr Asn Glu Asn Gly Lys 

30 35 40 

Glu Tyr Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro Pro 
45 50 55 

25 Val Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu Pro 
60 65 70 

Trp Gin Gly Val Phe Asn Ala Thr Leu Tyr Gly Asn Val Cys 
75 80 

Lys Ser Leu Asn Phe Phe Leu Lys Lys He Glu Gly Asp Glu 
30 85 90 95 

Asp Cys Leu Val Val Asn Val Tyr Ala Pro Lys Thr Thr Ser 
100 105 HO 

Asp Lys Lys Leu Pro Val Phe Phe Trp 
115 120 

35 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 364 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCAGAAAAAT ACTGGAAGTT TTTTATCAGA AGTTGTTTTT GGTGCGTACA 5 0 

CGTTTACTAC CAAGCAGTCT TCGTCTCCTT CAATTTTCTT CAAGAAGAAA 100 

5 TTTAAAGATT TACACACATT TCCGTATAAT GTGGCGTTGA AAACACCTTG 15 0 

CCATGGCTCA GCTTTCTGTG GAGGCTTAAA TCTAAGTTCT CCAACAGGAG 20 0 

GTTTAGCATA AGGTACACCT GTGTAGCTAA AATACTCTTT TCCATTTTCG 250 

TTCGTCAAAG CTTTTCCAAC CAATTCGCCT TGGGGCAAAG TTACAGTCGG 300 

GGAATCACAG TTTATAAAAT TAAAACTAAA CAAAAAAATA CAACTTAAAA 350 

10 AAATAACACG AG AC 3 64 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 nucleotides 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) FEATURE: 

(A) NAME/KEY: CDS 
20 (B) LOCATION: 113.. 421 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

TTTACATTAC ATCAAATCAT ATTTTTATTA GTATATTTTT TAGAAGAACC 50 

TAGC CAAAAA AT ATGG AC TT TAGACTGTGA TTAATTTATT TTACCTGAGA 10 0 

TTTTCCTTTA CA ATG GGT GAT CTT CAA GTG ACT TTG TTA CAA 142 
25 Met Gly Asp Leu Gin Val Thr Leu Leu Gin 

15 10 

GGT TCT TTG AGA GGA AAA GAG CAA ATT AAT GAA AAG GGA AAT 184 
Gly Ser Leu Arg Gly Lys Glu Gin lie Asn Glu Lys Gly Asn 
15 2 0 

30 GTG TTT TAT AGT TAT TCT GGA ATT CCA TAT GCC AAA CCT CCA 22 6 

Val Phe Tyr Ser Tyr Ser Gly He Pro Tyr Ala Lys Pro Pro 
25 30 3 r 

GTT GGT GAT CTA AGA TTC AAG CCA CCT CAA CCT GCA GAA CCT 268 
Val Gly Asp Leu Arg Phe Lys Pro Pro Gin Pro Ala Glu Pro 
35 40 45 50 

TGG TCA GGT GTC CTT GAT GCT ACT AAA GAA GGG AAT AGT TGT 310 
Trp Ser Gly Val Leu Asp Ala Thr Lys Glu Gly Asn Ser Cys 
55 60 65 



40 



AGA TCT GTA CAT TTT ATT AAA AAG ATT AAA GTA GGG GCT GAA 
Arg Ser Val His Phe He Lys Lys He Lys Val Gly Ala Glu 
70 75 80 



352 
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GAT TGT CTA TAC CTC AAT GTC TAT GTA CCA AAA ACA TCA GAG 3 94 

Asp Cys Leu Tyr Leu Asn Val Tyr Val Pro Lys Thr Ser Glu 
85 90 

AAA TCC CTT CTT CCA GTA ATG GTA TGG 421 
5 Lys Ser Leu Leu Pro Val Met Val Trp 
95 100 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Gly Asp Leu Gin Val Thr Leu Leu Gin Gly Ser Leu Arg 
15 1 5 10 

Gly Lys Glu Gin He Asn Glu Lys Gly Asn Val Phe Tyr Ser 
15 2 0 25 

Tyr Ser Gly He Pro Tyr Ala Lys Pro Pro Val Gly Asp Leu 
^30 35 40 

20 Arg Phe Lys Pro Pro Gin Pro Ala Glu Pro Trp Ser Gly Val 
45 50 55 

Leu Asp Ala Thr Lys Glu Gly Asn Ser Cys Arg Ser Val His 
60 65 70 

Phe He Lys Lys He Lys Val Gly Ala Glu Asp Cys Leu Tyr 
25 75 80 

Leu Asn Val Tyr Val Pro Lys Thr Ser Glu Lys Ser Leu Leu 
85 90 95 

Pro Val Met Val Trp 
100 

30 (2) INFORMATION Fm SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (d) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CCATACCATT ACTGGAAGAA GGGATTTCTC TGATGTTTTT GGTACATAGA 50 
CATTGAGGTA TAGACAATCT TCAGCCCCTA CTTTAATCTT TTTAATAAAA 100 



* \ 
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TGTACAGATC TACAACTATT CCCTTCTTTA GTAGCATCAA GGACACCTGA 150 

CCAAGGTTCT GCAGGTTGAG GTGGCTTGAA TCTTAGATCA CCAACTGGAG 2 00 

GTTTGGCATA TGGAATTCCA GAATAACTAT AAAACACATT TCCCTTTTCA 250 

TTAATTTGCT CTTTTCCTCT CAAAGAACCT TGTAACAAAG TCACTTGAAG 3 00 

5' ATCACCCATT GTAAAGGAAA ATCTCAGGTA AAATAAATTA ATCACAGTCT 3 50 

AAAGTCCATA TTTTTTGGCT AGGTTCTTCT AAAAAATATA CTAATAAAAA 4 00 

TATGATTTGA TGTAATGTAA A 42 1 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

10 (A) LENGTH: 524 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

15 (iii) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 113 .. 523 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GAACGTTGAT ACGATAGACA TGTCGTCTTC AAAACGTCTA TTTTATCATA 5 0 

20 AACAAAACGA GATAAATAAT AACAATTAAG CAACCAAAAT GCATTAAAAA 100 

ACACAATAAA AA ATG TTA CCT CAC AGT AGT GCA TTA GTT TTA 14 2 

Met Leu Pro His Ser Ser Ala Leu Val Leu 
15 10 

TTT TTA TTT TTT TTA TTT TTC TTA TTT ACA CCT ATC TTG TGC 184 
25 Phe Leu Phe Phe Leu Phe Phe Leu Phe Thr Pro lie Leu Cys 

15 20 

ATA CTA TGG GAT AAC CTA GAT CAG CAT TTG TGC AGA GTT CAA 22 6 
lie Leu Trp Asp Asn Leu Asp Gin His Leu Cys Arg Val Gin 
25 30 35 

30 TTT AAC AGG ATC ACG GAA GGA AAA CCG TTC CGA TAT AAA GAT 268 
Phe Asn Arg He Thr Glu Gly Lys Pro Phe Arg Tyr Lys Asp 
40 45 50 

CAT AGG W GAT GTA TAT TGT TCT TAT TTG GGA ATT CCT TAT 310 
His Arg Asn Asp Val Tyr Cys Ser Tyr Leu Gly He Pro Tyr 
35 55 60 65 

GCC GAA CCG CCT ATT GGA CCA TTA CGA TTT CAG TCT CCA AAA 3 52 

Ala Glu Pro Pro He Gly Pro Leu Arg Phe Gin Ser Pro Lys 
70 75 80 

CCA ATA TCA AAT CCA AAA ACA GGA TTC GTA CAG GCT CGA ACT 3 94 

40 Pro He Ser Asn Pro Lys Thr Gly Phe Val Gin Ala Arg Thr 

85 90 
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TTG GGA GAC AAA TGT TTC CAG GAA AGT CTA ATA TAT TCT TAT 436 
Leu Gly Asp Lys Cys Phe Gin Glu Ser Leu lie Tyr Ser Tyr 
95 * 100 105 

GCA GGA AGC GAA GAT TGC TTA TAT CTG AAT ATA TTC ACG CCA 478 
5 Ala Gly Ser Glu Asp Cys Leu Tyr Leu Asn He Phe Thr Pro 
110 H5 120 

GAG ACT GTT AAT TCT GCG AAC AAT ACA AAA TAT CCT GTA ATG 52 0 

Glu Thr Val Asn Ser Ala Asn Asn Thr Lys Tyr Pro Val Met 
125 130 135 

10 TTC T 524 
Phe 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

]5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Leu Pro His Ser Ser Ala Leu Val Leu Phe Leu Phe Phe 
20 1 5 10 

Leu Phe Phe Leu Phe Thr Pro He Leu Cys He Leu Trp Asp 
15 20 25 

Asn Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn Arg He 
30 35 40 

25 Thr Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Arg Asn Asp 
45 50 55 

Val Tyr Cys Ser Tyr Leu Gly He Pro Tyr Ala Glu Pro Pro 
60 65 70 

He Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro He Ser Asn 
30 75 80 

Pro Lys Thr Gly Phe Val Gin Ala Arg Thr Leu Gly Asp Lys 
85 90 95 

Cys Phe Gin Glu Ser Leu He Tyr Ser Tyr Ala Gly Ser Glu 
100 105 HO 

35 Asp Cys Leu Tyr Leu Asn He Phe Thr Pro Glu Thr Val Asn 
115 120 125 

Ser Ala Asn Asn Thr Lys Tyr Pro Val Met Phe 
130 135 



(2) INFORMATION FOR SEQ ID NO:12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

AGAACATTAC AGGATATTTT GTATTGTTCG CAGAATTAAC AGTCTCTGGC 50 

GTGAATATAT TCAGATATAA GCAATCTTCG CTTCCTGCAT AAGAATATAT 10 0 

10 TAG AC TTTCC TGGAAACATT TGTCTCCCAA AGTTCGAGCC TGTACGAATC 15 0 

CTGTTTTTGG ATTTGATATT GGTTTTGGAG ACTGAAATCG TAATGGTCCA 2 00 

ATAGGCGGTT CGGCATAAGG AATTCCCAAA TAAGAACAAT ATACATCATT 2 50 

CCTATGATCT TTATATCGGA ACGGTTTTCC TTCCGTGATC CTGTTAAATT 30 0 

GAACTCTGCA C AAATG CTG A TCTAGGTTAT CCCATAGTAT GCACAAGATA 3 50 

15 GGTGTAAATA AGAAAAATAA AAAAAATAAA AATAAAACTA ATGCACTACT 400 

GTGAGGTAAC ATTTTTTATT GTGTTTTTTA ATGCATTTTG GTTGCTTAAT 45 0 

TGTTATTATT TATCTCGTTT TGTTTATGAT AAAATAGACG TTTTGAAGAC 500 

GACATGTCTA TCGTATCAAC GTTC 524 

(2) INFORMATION FOR SEQ ID NO: 13: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1982 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: cDNA 

(iii) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 31.. 1517 

(iv) FEATURE: 

30 (A) NAME/KEY: Asx = Asn or Asp 

(B) LOCATION: 300 



(v) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AT TTT AGC TAC ACA GGT GTA CCT TAT C CT AAA CCT CCT GTT 41 
Phe Ser Tyr Thr Gly Val Pro Tyr /\la Lys Pro Pro Vr 1 
35 1 5 10 

GGA GAA CTT AGA TTT AAG CCT CCA CAG AAA GCT GAG CCA TGG 8 3 

Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu Pro Trp 
15 20 25 

CAA GGT GTT TTC AAC GCC ACA TTA TAC GGA AAT GTG TGT AAA 12 5 

40 Gin Gly Val Phe Asn Ala Thr Leu Tyr Gly Asn Val Cys Lys 
30 35 40 
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TCT TTA AAT TTC TTC TTG AAG AAA ATT GAA GGA GAC GAA GAC 
Ser Leu Asn Phe Phe Leu Lys Lys lie Glu Gly Asp Glu Asp 
45 50 55 

TGC TTG GTA GTA AAC GTG TAC GCA CCA AAA ACA ACT TCT GAT 
Cys Leu Val Val Asn Val Tyr Ala Pro Lys Thr Thr Ser Asp 
60 65 



167 



209 



AAA AAA CTT CCA GTA TTT TTC TGG GTT CAT GGT GGT GGT TTT 
Lys Lys Leu Pro Val Phe Phe Trp Val His Gly Gly Gly Phe 
70 75 .80 

10 GTG ACT GGA TCC GGA AAT TTA GAA TTC CAA AGC CCA GAT TAT 
Val Thr Gly Ser Gly Asn Leu Glu Phe Gin Ser Pro Asp Tyr 
8 5 90 95 

TTA GTA RAT TTT GAT GTT ATT TTC GTA ACT TTC AAT TAC CGA 
Leu Val Asx Phe Asp Val He Phe Val Thr Phe Asn Tyr Arg 
15 100 105 no 

TTG GGA CCT CTC GGA TTT CTG AAT TTG GAG TTG GAG GGT GCT 
Leu Gly Pro Leu Gly Phe Leu Asn Leu Glu Leu Glu Gly Ala 
115 120 125 

CCA GGA AAT GTA GGA TTA TTG GAT CAG GTG GCA GCT CTG AAA 
20 Pro Gly Asn Val Gly Leu Leu Asp Gin Val Ala Ala Leu Lys 

130 135 



251 



293 



335 



377 



419 



TGG ACC AAA GAA AAC ATT GAG AAA TTT GGT GGA GAT CCA GAA 
Trp Thr Lys Glu Asn He Glu Lys Phe Gly Gly Asp Pro Glu 
140 145 150 

25 AAT ATT ACA ATT GGT GGT GTT TCT GCT GGT GGA GCA AGT GTT 
Asn He Thr He Gly Gly Val Ser Ala Gly Gly Ala Ser Val 
155 160 165 

CAT TAT CTT TTG TTA TCT CAT ACA ACC ACT GGA CTT TAC AAA 
His Tyr Leu Leu Leu Ser His Thr Thr Thr Gly Leu Tyr Lys 
30 170 175 180 

AGG GCA ATT GCT CAA AGT GGA AGT GCT TTT AAT CCA TGG GCC 
Arg Ala He Ala Gin Ser Gly Ser Ala Phe Asn Pro Trp Ala 
185 190 195 

TTC CAA Avi?. CAT CCA GTA APG JGT AGT CTT CAA CTT GCT GAG 
35 Phe Gin Arg His Pro Val Lys Arg Ser Leu Gin Leu Ala Glu 

200 205 



461 



503 



545 



587 



629 



ATA TTG GGT CAT CCC ACA AAC AAT ACT CAA GAT GCT TTA GAA 
He Leu Gly His Pro Thr Asn Asn Thr Gin Asp Ala Leu Glu 
210 215 220 



40 TTC TTA CAA AAA GCC CCC GTA GAC AGT CTC CTG AAG AAA 
Phe Leu Gin Lys Ala Pro Val Asp Ser Leu Leu Lys Lys 
225 230 235 



v 



lTG 
Met 



671 



713 



3 
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CCA GCT GAA ACA GAA GOT GAA ATA ATA GAA GAG TTT GTC TTC 7 55 

Pro Ala Glu Thr Glu Gly Glu He He Glu Glu Phe Val Phe 
240 245 250 

GTA CCA TCA ATT GAA AAA GTT TTC CCA TCC CAC CAA CCT TTC 797 
5 Val Pro Ser He Glu Lys Val Phe Pro Ser His Gin Pro Phe 
255 260 265 

TTG GAA GAA TCA CCA TTG GCC AGA ATG AAA TCC GGA TCC TTT 83 9 

Leu Glu Glu Ser Pro Leu Ala Arg Met Lys Ser Gly Ser Phe 
270 275 
10 AAC AAA GTA CCT TTA TTA GTT GGA TTT AAC AGT GCA GAA GGA 881 
Asn Lys Val Pro Leu Leu Val Gly Phe Asn Ser Ala Glu Gly 
280 285 290 

CTT TTG TTC AAA TTC TTC ATG AAA GAA AAA CCA GAG ATG CTG 923 
Leu Leu Phe Lys Phe Phe Met Lys Glu Lys Pro Glu Met Leu 
15 295 300 305 

AAC CAA GCT GAA GCA GAT TTT GAA AGA CTC GTA CCA GCC GAA 965 
Asn Gin Ala Glu Ala Asp Phe Glu Arg Leu Val Pro Ala Glu 
310 315 320 

TTT GAA TTA GTC CAT GGA TCA GAG GAA TCG AAA AAA CTT GCA 1007 
20 Phe Glu Leu Val His Gly Ser Glu Glu Ser Lys Lys Leu Ala 
325 330 335 

GAA AAA ATC AGG AAG TTT TAC TTT GAC GAT AAA CCC GTT CCA 104 9 

Glu Lys He Arg Lys Phe Tyr Phe Asp Asp Lys Pro Val Pro 
340 345 

25 GAA AAT GAA CAG AAA TTT ATT GAC TTG ATA GGA GAT ATT TGG 10 91 

Glu Asn Glu Gin Lys Phe He Asp Leu He Gly Asp He Trp 
350 355 360 

TTT ACT AGA GGT GTT GAC AAG CAT GTC AAG TTG TCT GTG GAG 113 3 

Phe Thr Arg Gly Val Asp Lys His Val Lys Leu Ser Val Glu 
30 365 ~ 370 375 

AAA CAA GAC GAA CCA GTT TAT TAT TAT GAA TAT TCC TTC TCG 117 5 

Lys Gin Asp Glu Pro Val Tyr Tyr Tyr Glu Tyr Ser Phe Ser 
380 385 390 

GAA AGT CAT CCT GCA AAA GGA ACA TTT GGT GAT CAT AAT CTG 1217 
35 Glu Ser His Pro Ala Lys Gly Thr Phe Gly Asp His Asn Leu 
395 400 405 

ACT GGT GCA TGC CAT GGA GAA GAA CTT GTG AAT TTA TTC AAA 12 5 9 

Thr Gly Ala Cys His Gly Glu Glu Leu Val Asn Leu Phe Lys 
410 415 

40 GTC GAG ATG ATG AAG CTG GAA AAA GAT AAA CCT AAT GTT CTA 13 01 

Val Glu Met Met Lys Leu Glu Lys Asp Lys Pro Asn Val Leu 
420 425 430 
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TTA ACA AAA GAT AGA GTA CTT GCC ATG TGG ACT AAC TTC ATC 1343 
Leu Thr Lys Asp Arg Val Leu Ala Met Trp Thr Asn Phe lie 
435 ^ 440 445 

AAA AAT GGA AAT CCT ACT CCT GAA GTA ACA GAA TTA TTG CCA 13 85 

5 Lys Asn Gly Asn Pro Thr Pro Glu Val Thr Glu Leu Leu Pro 
450 455 460 

GTT AAA TGG GAA CCT GCC ACA AAA GAC AAG TTG AAT TAT TTG 14 2 7 

Val Lys Trp Glu Pro Ala Thr Lys Asp Lys Leu Asn Tyr Leu 
465 470 475 

10 AAC ATT GAT GCC ACC TTA ACT TTG GGA ACA AAT CCT GAG GCA 14 6 9 

Asn lie Asp Ala Thr Leu Thr Leu Gly Thr Asn Pro Glu Ala 
480 485 

AAC CGA GTC AAA TTT TGG GAA GAC GCC ACA AAA TCT TTG CAC 1511 
Asn Arg Val Lys Phe Trp Glu Asp Ala Thr Lys Ser Leu His 
15 490 ~ 495 500 

GGT CAA TAA TAATTTATGA AAATTGTTTT AAATACTTTA GGTAATATAT 1560 
Gly Gin 

TAGGTAAATA AAAATTAAAA AATAACAATT TTTATGTTTT ATGTATTGGC 1610 

TTATGTGTAT CAGTTCTAAT TTTATTTATT TATTCTTGTT TTGCTTGTTT 16 60 

20 TGAAATATCA TGGTTTTAAT TTTCAAAACA CAACGTCGTT TGTTTTTAGC 1710 

AAAATTTC C A ATAGATATGT TATATTAAGT ACTCTGAAGT ATTTTTATAT 17 60 

ATACACTAAA ATCAGTAAAA ATACATTAAC TAAAAATATA AG AT ATTTT C 1810 

AATAATTTTT TTTAAAGAAA ATACCAAAAA TAAAGTAAAA TTC C AAACGG 1860 

AATTTTTGTT TAACTTAAAA ATAAAATTAA CTCTTCAATA ATTTTGATAA 1910 

25 TTAGTATTTC TGATATCATT AGTGAAAATT ATATTTTGAT AATACGTATT 196 0 

TATATTTAAA ATAAAATTAT GT 1982 

(2) INFORMATION FOR SEQ ID NO : 14 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 5 amino acids 

30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro Pro Val G .y 
35 i 5 io 

Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu Pro Trp Gin 
15 20 25 

Gly Val Phe Asn Ala Thr Leu Tyr Gly Asn Val Cys Lys Ser 
30 35 40 

40 Leu Asn Phe Phe Leu Lys Lys He Glu Gly Asp Glu Asp Cys 
45 50 55 
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Leu Val Val Asn Val Tyr Ala Pro Lys Thr Thr Ser Asp Lys 
60 65 70 

Lys Leu Pro Val Phe Phe Trp Val His Gly Gly Gly Phe Val 
75 80 

5 Thr Gly Ser Gly Asn Leu Glu Phe Gin Ser Pro Asp Tyr Leu 
85 90 95 

Val Asx Phe Asp Val He Phe Val Thr Phe Asn Tyr Arg Leu 
100 105 HO 

Gly Pro Leu Gly Phe Leu Asn Leu Glu Leu Glu Gly Ala Pro 
10 115 120 125 

Gly Asn Val Gly Leu Leu Asp Gin Val Ala Ala Leu Lys Trp 
130 135 140 

Thr Lys Glu Asn He Glu Lys Phe Gly Gly Asp Pro Glu Asn 
145 150 

15 He Thr He Gly Gly Val Ser Ala Gly Gly Ala Ser Val His 
155 160 165 

Tyr Leu Leu Leu Ser His Thr Thr Thr Gly Leu Tyr Lys Arg 
170 175 180 

Ala He Ala Gin Ser Gly Ser Ala Phe Asn Pro Trp Ala Phe 
20 185 190 195 

Gin Arg His Pro Val Lys Arg Ser Leu Gin Leu Ala Glu He 
200 205 210 

Leu Gly His Pro Thr Asn Asn Thr Gin Asp Ala Leu Glu Phe 
215 220 

25 Leu Gin Lys Ala Pro Val Asp Ser Leu Leu Lys Lys Met Pro 
225 230 235 

Ala Glu Thr Glu Gly Glu He He Glu Glu Phe Val Phe Val 
240 245 250 

Pro Ser He Glu Lys Val Phe Pro Ser His Gin Pro Leu Leu 
30 255 T60 265 

Glu Glu Ser Pro Leu Ala Arg Met Lys Ser Gly ber Phe Asn 
270 275 280 

Lys Val Pro Leu Leu Val Gly Phe Asn Ser Ala Glu Gly Leu 
285 290 

35 Leu Phe Lys Phe Phe Met Lys Glu Lys Pro Glu Met Leu Asn 
295 300 305 



Gin Ala Glu Ala Asp Phe Glu Arg Leu Val Pro Ala Glu Phe 
310 315 320 
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Glu Leu Val His 
325 

Lys lie Arg Lys 
340 

5 Asn Glu Gin Lys 



Thr Arg Gly Val 
365 

Gin Asp Glu Pro 
10 380 

Ser His Pro Ala 
395 

Gly Ala Cys His 
410 

15 Glu Met Met Lys 



Gly Ser Glu Glu Ser 
330 

Phe Tyr Phe Asp Asp 
345 

Phe lie Asp Leu lie 
355 

Asp Lys His Val Lys 
370 

Val Tyr Tyr Tyr Glu 
385 

Lys Gly Thr Phe Gly 
400 

Gly Glu Glu Leu Val 
415 

Leu Glu Lys Asp Lys 
425 



Lys Lys Leu Ala Glu 
335 

Lys Pro Val Pro Glu 
350 

Gly Asp lie Trp Phe 
360 

Leu Ser Val Glu Lys 
375 

Tyr Ser Phe Ser Glu 
390 

Asp His Asn Leu Thr 
405 

Asn Leu Phe Lys Val 
420 

Pro Asn Val Leu Leu 
430 



Thr Lys Asp Arg Val Leu Ala 
435 440 

Asn Gly Asn Pro Thr Pro Glu 
20 450 455 

Lys Trp Glu Pro Ala Thr Lys 
465 

He Asp Ala Thr Leu Thr Leu 
480 

25 

Arg Val Lys Phe Trp Glu Asp 
495 



Met Trp Thr Asn Phe He Lys 
445 

Val Thr Glu Leu Leu Pro Val 
460 

Asp Lys Leu Asn Tyr Leu Asn 
470 475 

Gly Thr Asn Pro Glu Ala Asn 
485 490 

Ala Thr Lys Ser Leu His Gly 
500 



Gin 
505 



30 (2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1982 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO : 15 : 

ACATAATTTT ATTTTAAATA TAAATACGTA TTATCAAAAT ATAATTTTCA 
CTAATGATAT CAGAAATACT AATTATCAAA ATTATTGAAG AGTTAATTTT 
40 ATTTTTAAGT TAAACAAAAA TTCCGTTTGG AATTTTACTT TATTTTTGGT 



-112- FC-1-C1-PCT 

ATTTTCTTTA AAAAAAATTA TTGAAAATAT CTTATATTTT TAGTTAATGT 200 

ATTTTTACTG ATTTTAGTGT ATATATAAAA ATACTTCAGA GTACTTAATA 250 

TAACATATCT ATTGGAAATT TTGCTAAAAA CAAACGACGT TGTGTTTTGA 3 00 

AAATTAAAAC CATGATATTT CAAAACAAGC AAAACAAGAA TAAATAAATA 3 50 

5 AAATTAGAAC TGATACACAT AAGCCAATAC ATAAAACATA AAAATTGTTA 4 00 

TTTTTTAATT TTTATTTACC TAATATATTA CCTAAAGTAT TTAAAACAAT 450 

TTTCATAAAT TATTATTGAC CGTGCAAAGA TTTTGTGGCG TCTTCCCAAA 500 

ATTTGACTCG GTTTGCCTCA GGATTTGTTC CCAAAGTTAA GGTGGCATCA 550 

ATGTTCAAAT AATTCAACTT GTCTTTTGTG GCAGGTTCCC ATTTAACTGG 600 

10 CAATAATTCT GTTACTTCAG GAGTAGGATT TCCATTTTTG ATGAAGTTAG 650 

TCCACATGGC AAGTACTCTA TCTTTTGTTA ATAGAACATT AGGTTTATCT 70 0 

TTTTCCAGCT TCATCATCTC GACTTTGAAT AAATTCACAA GTTCTTCTCC 75 0 

ATGGCATGCA C CAGTCAG AT TATGATCACC AAATGTTCCT TTTGCAGGAT 800 

GACTTTCCGA GAAGGAATAT TCATAATAAT AAACTGGTTC GTCTTGTTTC 85 0 

15 TCCACAGACA ACTTGACATG CTTGTCAACA CCTCTAGTAA ACCAAATATC 900 

TCCTATCAAG TCAATAAATT TCTGTTCATT TTCTGGAACG GGTTTATCGT 95 0 

CAAAGTAAAA CTTCCTGATT TTTTCTGCAA GTTTTTTCGA TTCCTCTGAT 10 00 

CCATGGACTA ATTCAAATTC GGCTGGTACG AGTCTTTCAA AATCTGCTTC 105 0 

AG CTTGGTTC AGCATCTCTG GTTTTTCTTT CATGAAGAAT TTGAACAAAA 1100 

20 GTCCTTCTGC ACTGTTAAAT CCAACTAATA AAGGTACTTT GTTAAAGGAT 115 0 

CCGGATTTCA TTCTGGCCAA TGGTGATTCT TCCAAGAAAG GTTGGTGGGA 12 00 

TGGGAAAACT TTTTCAATTG ATGGTACGAA GACAAACTCT TCTATTATTT 12 5 0 

CACCTTCTGT TTCAGCTGGC ATTTTCTTCA GGAGACTGTC TACGGGGGCT 13 0 0 

TTTTGTAAGA ATTCTAAAGC ATCTTGAGTA TTGTTTGTGG GATGACCCAA 13 50 

25 TATCTCAGCA AGTTGAAGAC TACGCTTTAC TGGATGTCTT TGGAAGGCCC 14 0 0 

ATGGATTAAA AGCACTTCCA CTTTGAGCAA TTGCCCTTTT GTAAAGTCCA 14 50 

GTGGTTGTAT GAGATAACAA AAGATAATGA ACACTTGCTC CACCAGCAGA 15 00 

AACACCACCA ATTGTAATAT TTTCTGGATC TCCACCAAAT TTCTCAATGT 1550 

TTTCTTTGGT C C ATTTC AG A GCTGCCACCT GATCCAATAA TCCTACATTT 16 0 0 

30 CCTGGAGCAC CCTCCAACTC CAAATTCAGA AATCCGAGAG GTCCCAATCG 16 5 0 

GTAATTGAAA GTTACGAAAA TAACATCAAA ATYTACTAAA TAATCTGGGC 17 00 

TTTGGAATTC TAAATTTCCG GATCCAGTCA CAAAACCACC ACCATGAACC 17 50 

CAGAAAAATA CTGGAAGTTT TTTATCAGAA GTTGTTTTTG GTGCGTACAC 18 00 

GTTTACTACC AAGCAGTCTT CGTCTCGTTC AATTTTCTTC AAGAAGAAAT 18 5 0 

35 TTAAAGATTT ACACACATTT CCGTATAATG TGGCGTTGAA AACACCTTGC 19 00 

CATGGCTCAG CTTTCTGTGG AGGCTTAAAT CTAAGTTCTC CAACAGGAGG 19 50 

TTTAGCATAA GGTACACCTG TGTAGCTAAA AT 19 82 

(2) INFORMATION FOR SEQ ID NO : 16 : 

(i) SEQUENCE CHARACTERISTICS: 

40 (A) LENGTH: 1515 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 (iii) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1515 

(iv) FEATURE: 

(A) NAME /KEY : Asx = Asn or Asp 
50 (B) LOCATION: 2 98 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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TTT AGC TAC ACA GGT GTA CCT TAT GCT AAA CCT CCT GTT 3 9 

Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro Pro Val 
15 10 

GGA GAA CTT AGA TTT AAG CCT CCA CAG AAA GCT GAG CCA TGG 81 
5 Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu Pro Trp 
15 2 0 2 5 

CAA GGT GTT TTC AAC GCC ACA TTA TAC GGA AAT GTG TGT AAA 123 
Gin Gly Val Phe Asn Ala Thr Leu Tyr Gly Asn Val Cys Lys 
30 35 40 

10 TCT TTA AAT TTC TTC TTG AAG AAA ATT GAA GGA GAC GAA GAC 165 
Ser Leu Asn Phe Phe Leu Lys Lys lie Glu Gly Asp Glu Asp 
45 50 55 

TGC TTG GTA GTA AAC GTG TAC GCA CCA AAA ACA ACT TCT GAT 2 07 

Cys Leu Val Val Asn Val Tyr Ala Pro Lys Thr Thr Ser Asp 
15 60 65 

AAA AAA CTT CCA GTA TTT TTC TGG GTT CAT GGT GGT GGT TTT 24 9 

Lys Lys Leu Pro Val Phe Phe Trp Val His Gly Gly Gly Phe 
70 75 80 

GTG ACT GGA TCC GGA AAT TTA GAA TTC CAA AGC CCA GAT TAT 2 91 

20 Val Thr Gly Ser Gly Asn Leu Glu Phe Gin Ser Pro Asp Tyr 
85 90 95 

TTA GTA RAT TTT GAT GTT ATT TTC GTA ACT TTC AAT TAC CGA 333 
Leu Val Asx Phe Asp Val lie Phe Val Thr Phe Asn Tyr Arg 
100 105 110 

25 TTG GGA CCT CTC GGA TTT CTG AAT TTG GAG TTG GAG GGT GCT 375 
Leu Gly Pro Leu Gly Phe Leu Asn Leu Glu Leu Glu Gly Ala 
115 120 125 

CCA GGA AAT GTA GGA TTA TTG GAT CAG GTG GCA GCT CTG AAA 417 
Pro Gly Asn Val Gly Leu Leu Asp Gin Val Ala Ala Leu Lys 
30 130 135 

TGG ACC AAA GAA AAC ATT GAG AAA TTT GGT GGA GAT CCA GAA 459 
Trp Thr Lys Glu Asn lie Glu Lys Phe Gly Gly Asp Pro Glu 
140 145 150 

AAT ATT ACA ATT GGT GGT GTT TCT GCT GGT GGA JC ^ AGT GTT 501 
35 Asn lie Thr He Gly Gly Val Ser Ala Gly Gly Ala Ser Val 
155 160 165 

CAT TAT CTT TTG TTA TCT CAT ACA ACC ACT GGA CTT TAC AAA 54 3 

His Tyr Leu Leu Leu Ser His Thr Thr Thr Gly Leu Tyr Lys 
170 175 180 

40 AGG GCA ATT GCT CAA AGT GGA AGT GCT TTT AAT CCA TGG GCC 5 85 

Arg Ala He Ala Gin Ser Gly Ser Ala Phe Asn Pro Trp Ala 
185 190 195 



-114- 



TTC CAA 
Phe Gin 



ATA TTG 
5 lie Leu 
210 

TTC TTA 
Phe Leu 
225 

10 CCA GCT 
Pro Ala 



GTA CCA 
Val Pro 

15 

TTG GAA 
Leu Glu 



AAC AAA 
20 Asn Lys 
280 

CTT TTG 
Leu Leu 
295 

25 AAC CAA 
Asn Gin 



TTT GAA 
Phe Glu 

30 

GAA AAA 
Glu Lys 

GAA AAT 
35 Glu Asn 
350 

TTT ACT 
Phe Thr 
365 

40 AAA CAA 
Lys Gin 



AGA CAT CCA 
Arg His Pro 
200 

GGT CAT CCC 
Gly His Pro 



CAA AAA GCC 
Gin Lys Ala 



GAA ACA GAA 
Glu Thr Glu 
240 

TCA ATT GAA 
Ser lie Glu 
255 

GAA TCA CCA 
Glu Ser Pro 
270 

GTA CCT TTA 
Val Pro Leu 



TTC AAA TTC 
Phe Lys Phe 



GCT GAA GCA 
Ala Glu Ala 
310 

TTA GTC CAT 
Leu Val His 
325 

ATC AGG AAG 
lie Arg Lys 
340 

GAA CAG AAA 
Glu Gin Lys 



AGA GGT GTT 
Arg Gly Val 



GAC GAA CCA 
Asp Glu Pro 
38Q 



GTA AAG CGT 
Val Lys Arg 



ACA AAC AAT 
Thr Asn Asn 
215 

CCC GTA GAC 
Pro Val Asp 
230 

GGT GAA ATA 
Gly Glu lie 
245 

AAA GTT TTC 
Lys Val Phe 



TTG GCC AGA 
Leu Ala Arg 



TTA GTT GGA 
Leu Val Gly 
285 

TTC ATG AAA 
Phe Met Lys 
300 

GAT TTT GAA 
Asp Phe Glu 
315 

GGA TCA GAG 
Gly Ser Glu 



TTT TAC TTT 
Phe Tyr Phe 

TTT ATT GAC 
F.ie lie Asp 
355 

GAC AAG CAT 
Asp Lys His 
370 

GTT TAT TAT 
Val Tyr Tyr 
385 



AGT CTT CAA 
Ser Leu Gin 
205 

ACT CAA GAT 
Thr Gin Asp 
220 

AGT CTC CTG 
Ser Leu Leu 



ATA GAA GAG 
He Glu Glu 



CCA TCC CAC 
Pro Ser His 
260 

ATG AAA TCC 
Met Lys Ser 
275 

TTT AAC AGT 
Phe Asn Ser 
290 

GAA AAA CCA 
Glu Lys Pro 



AGA CTC GTA 
Arg Leu Val 



GAA TCG AAA 
Glu Ser Lys 
330 

GAC GAT AAA 
Asp Asp Lys 
345 

TTG ATA GGA 
Leu II » Gly 
360 

GTC AAG TTG 
Val Lys Leu 



TAT GAA TAT 
Tyr Glu Tyr 



CTT GCT GAG 
Leu Ala Glu 



GCT TTA GAA 
Ala Leu Glu 



AAG AAA ATG 
Lys Lys Met 
235 

TTT GTC TTC 
Phe Val Phe 
250 

CAA CCT TTC 
Gin Pro Leu 
265 

GGA TCC TTT 
Gly Ser Phe 



GCA GAA GGA 
Ala Glu Gly 



GAG ATG CTG 
Glu Met Leu 
305 

CCA GCC GAA 
Pro Ala Glu 
320 

AAA CTT GCA 
Lys Leu Ala 
335 

CCC GTT CCA 
Pro Val Pro 

GAT ATT TGG 
Asp He Trp 



TCT GTG GAG 
Ser Val Glu 
375 

TCC TTC TCG 
Ser Phe Ser 
390 



FC-1-C1-PCT 

627 

669 

711 

753 

795 

837 

879 

921 

963 

1005 

1047 
1089 

1131 

1173 
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GAA AGT CAT CCT GCA AAA GGA ACA TTT GGT GAT CAT AAT CTG 1215 
Glu Ser His Pro Ala Lys Gly Thr Phe Gly Asp His Asn Leu 
395 400 405 

ACT GGT GCA TGC CAT GGA GAA GAA CTT GTG AAT TTA TTC AAA 12 57 

5 Thr Gly Ala Cys His Gly Glu Glu Leu Val Asn Leu Phe Lys 

410 415 

GTC GAG ATG ATG AAG CTG GAA AAA GAT AAA CCT AAT GTT CTA 12 99 

Val Glu Met Met Lys Leu Glu Lys Asp Lys Pro Asn Val Leu 
420 425 430 

10 TTA ACA AAA GAT AGA GTA CTT GCC ATG TGG ACT AAC TTC ATC 1341 
Leu Thr Lys Asp Arg Val Leu Ala Met Trp Thr Asn Phe lie 
435 440 445 

AAA AAT GGA AAT CCT ACT CCT GAA GTA ACA GAA TTA TTG CCA 13 83 

Lys Asn Gly Asn Pro Thr Pro Glu Val Thr Glu Leu Leu Pro 
15 450 455 460 

GTT AAA TGG GAA CCT GCC ACA AAA GAC AAG TTG AAT TAT TTG 14 25 

Val Lys Trp Glu Pro Ala Thr Lys Asp Lys Leu Asn Tyr Leu 
465 470 475 

AAC ATT GAT GCC ACC TTA ACT TTG GGA ACA AAT CCT GAG GCA 14 67 

20 Asn lie Asp Ala Thr Leu Thr Leu Gly Thr Asn Pro Glu Ala 

480 485 

AAC CGA GTC AAA TTT TGG GAA GAC GCC ACA AAA TCT TTG CAC 150 9 

Asn Arg Val Lys Phe Trp Glu Asp Ala Thr Lys Ser Leu His 
490 495 500 

25 GGT CAA 1515 
Gly Gin 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1515 nucleotides 

30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

35 TTGACCGTGC AAAGATTTTG TGGCGTCTTC CCAAAATTTG ACTCGGTTTG 50 

CCTCAGGATT TGTTCCCAAA GTTAAGGTGG CATCAATGTT CAAATAATTC 10 0 

AACTTGTCTT TTGTGGCAGG TTCCCATTTA ACTGGCAATA ATTCTGTTAC 15 0 

TTCAGGAGTA GGATTTCCAT TTTTGATGAA GTTAGTCCAC ATGGCAAGTA 200 

CTCTATCTTT TGTTAATAGA ACATTAGGTT TATCTTTTTC CAGCTTCATC 2 50 

40 ATCTCGACTT TGAATAAATT CACAAGTTCT TCTCCATGGC ATGCACCAGT 3 00 

CAGATTATGA TCACCAAATG TTCCTTTTGC AGGATGACTT TCCGAGAAGG 3 50 

AATATTCATA ATAATAAACT GGTTCGTCTT GTTTCTCCAC AGACAACTTG 4 00 

ACATGCTTGT CAACACCTCT AGTAAACCAA ATATCTCCTA TCAAGTCAAT 45 0 

AAATTTCTGT TCATTTTCTG GAACGGGTTT ATCGTCAAAG TAAAACTTCC 500 
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TGATTTTTTC TGCAAGTTTT TTCGATTCCT CTGATCCATG GACTAATTCA 550 

AATTCGGCTG GTACGAGTCT TTCAAAATCT GCTTCAGCTT GGTTCAGCAT 60 0 

CTCTGGTTTT TCTTTCATGA AGAATTTGAA CAAAAGTCCT TCTGCACTGT 650 

TAAATCCAAC TAATAAAGGT ACTTTGTTAA AGGATCCGGA TTTCATTCTG 700 

5 GCCAATGGTG ATTCTTCCAA GAAAGGTTGG TGGGATGGGA AAACTTTTTC 75 0 

AATTGATGGT ACGAAGACAA ACTCTTCTAT TATTTCACCT TCTGTTTCAG 800 

CTGGCATTTT CTTCAGGAGA CTGTCTACGG GGG CTTTTTG TAAGAATTCT 850 

AAAGCATCTT GAGTATTGTT TGTGGGATGA CCCAATATCT CAGCAAGTTG 90 0 

AAGACTACGC TTTACTGGAT GTCTTTGGAA GGCCCATGGA TTAAAAGCAC 95 0 

10 TTCCACTTTG AGCAATTGCC CTTTTGTAAA GTCCAGTGGT TGTATGAGAT 10 00 

AACAAAAGAT AATGAACACT TGCTCCACCA GCAGAAACAC CACCAATTGT 1050 

AATATTTTCT GGATCTCCAC CAAATTTCTC AATGTTTTCT TTGGTCCATT 1100 

TCAGAGCTGC CACCTGATCC AATAATCCTA CATTTCCTGG AGCACCCTCC 1150 

AACTCCAAAT TCAGAAATCC GAGAGGTCCC AAT CGGTAAT TGAAAGTTAC 12 00 

15 GAAAATAACA TCAAAATYTA CTAAATAATC TGGGCTTTGG AATTCTAAAT 12 50 

TTCCGGATCC AGTCACAAAA CCACCACCAT GAACCCAGAA AAATACTGGA 13 00 

AGTTTTTTAT CAGAAGTTGT TTTTGGTGCG TACACGTTTA CTACCAAGCA 13 50 

GTCTTCGTCT CCTTCAATTT TCTTCAAGAA GAAATTTAAA GATTTACACA 14 0 0 

CATTTCCGTA TAATGTGGCG TTGAAAACAC CTTGCCATGG CTCAGCTTTC 14 5 0 

20 TGTGGAGGCT TAAATCTAAG TTCTCCAACA GGAGGTTTAG CATAAGGTAC 15 00 

ACCTGTGTAG CTAAA 1515 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 92 nucleotides 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) FEATURE: 

30 (A) NAME /KEY : CDS 

(B) LOCATION: 49.. 1701 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ACTGTGTGCT AATAATTCAG TACACACAGT CAATAGTCTA GATCCAAG 4 8 

ATG TCT CGT GTT ATT TTT TTA AGT TGT ATT TTT TTG TTT AGT 9 0 
35 Met Ser Arg Val lie Phe Leu Ser Cys lie Phe Leu Phe Ser 
15 10 

TTT AAT TTT ATA AAA TGT GAT CCC CCG ACT GTA ACT TTG CCC 1^2 
Phe Asn Phe lie Lys Cys Asp Pro Pro Thr Val Thr Leu Pro 
15 20 25 

40 CAG GGC GAA TTG GTT GGA AAA GCT TTG ACG AAC GAA AAT GGA 17 4 
Gin Gly Glu Leu Val Gly Lys Ala Leu Thr Asn Glu Asn Gly 
30 35 40 

AAA GAG TAT TTT AGC TAC AC A GGT GTG CCT TAT GCT AAA CCT 216 
Lys Glu Tyr Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro 
45 45 50 55 
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CCA GTT GGA GAA CTT AGA TTT AAG CCT CCA CAG AAA GCT GAG 258 
Pro Val Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu 
60 65 70 

CCA TGG AAT GGT GTT TTC AAC GCC ACA TCA CAT GGA AAT GTG 30 0 

5 Pro Trp Asn Gly Val Phe Asn Ala Thr Ser His Gly Asn Val 

75 80 

TGC AAA GCT TTG AAT TTC TTC TTG AAA AAA ATT GAA GGA GAC 34 2 

Cys Lys Ala Leu Asn Phe Phe Leu Lys Lys lie Glu Gly Asp 
85 90 95 

10 GAA GAC TGC TTG TTG GTG AAT GTG TAC GCA CCA AAA ACA ACT 3 84 

Glu Asp Cys Leu Leu Val Asn Val Tyr Ala Pro Lys Thr Thr 
100 105 HO 

TCT GAC AAA AAA CTT CCA GTA TTT TTC TGG GTT CAT GGT GGC 42 6 

Ser Asp Lys Lys Leu Pro Val Phe Phe Trp Val His Gly Gly 
15 115 120 125 

GGT TTT GTG ACT GGA TCC GGA AAT TTA GAA TTT CAA AGC CCA 4 68 

Gly Phe Val Thr Gly Ser Gly Asn Leu Glu Phe Gin Ser Pro 
13 0 13 5 14 0 

GAT TAT TTA GTA AAT TAT GAT GTT ATT TTT GTA ACT TTC AAT 510 
20 Asp Tyr Leu Val Asn Tyr Asp Val lie Phe Val Thr Phe Asn 

145 150 

TAC CGA TTG GGA CCA CTC GGA TTT TTG AAT TTG GAG TTG GAA 552 
Tyr Arg Leu Gly Pro Leu Gly Phe Leu Asn Leu Glu Leu Glu 
155 160 165 

25 GGT GCT CCT GGA AAT GTA GGA TTA TTG GAT CAG GTA GCA GCT 5 94 

Gly Ala Pro Gly Asn Val Gly Leu Leu Asp Gin Val Ala Ala 
170 175 180 

TTG AAA TGG ACC AAA GAA AAT ATT GAG AAA TTT GGT GGA GAT 63 6 

Leu Lys Trp Thr Lys Glu Asn lie Glu Lys Phe Gly Gly Asp 
30 185 190 195 

CCA GAA AAT ATT ACA ATT GGT GGT GTT TCT GCT GGT GGA GCA 67 8 

Pro Glu Asn lie Thr lie Gly Gly Val Ser Ala Gly Gly Ala 
200 205 210 

AGT GTT CAT TAT CTT TTA TTG TCA CAT ACi. ACC ACT GGA CTT 72 0 

35 Ser Val His Tyr Leu Leu Leu Ser His Thr Thr Thr Gly Leu 

215 220 

TAC AAA AGG GCA ATT GCT CAA AGT GGA AGT GCT TTA AAT CCA 7 62 

Tyr Lys Arg Ala lie Ala Gin Ser Gly Ser Ala Leu Asn Pro 
225 230 235 



40 TGG GCC TTC CAA AGA CAT CCA GTA AAG CGT AGT CTT CAA CTT 
Trp Ala Phe Gin Arg His Pro Val Lys Arg Ser Leu Gin Leu 
240 245 250 



804 
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GCT GAG ATA TTA GGT CAT CCC ACA AAC AAC ACT CAA GAT GCT 84 6 

Ala Glu lie Leu Gly His Pro Thr Asn Asn Thr Gin Asp Ala 
255 260 265 

TTA GAA TTC TTA CAA AAA GCC CCA GTA GAC AGT CTC CTG AAA 888 
5 Leu Glu Phe Leu Gin Lys Ala Pro Val Asp Ser Leu Leu Lys 
270 275 280 

AAA ATG CCA GCT GAA ACA GAA GGT GAA ATA ATA GAA GAG TTC 93 0 

Lys Met Pro Ala Glu Thr Glu Gly Glu He He Glu Glu Phe 
285 290 

10 GTC TTC GTA CCA TCA ATT GAA AAA GTT TTC CCA TCC CAC CAA 972 
Val Phe Val Pro Ser He Glu Lys Val Phe Pro Ser His Gin 
295 300 305 

CCT TTC TTG GAA GAA TCA CCA TTG GCC AGA ATG AAA TCT GGA 1014 
Pro Phe Leu Glu Glu Ser Pro Leu Ala Arg Met Lys Ser Gly 
15 310 315 320 

TCC TTT AAC AAA GTA CCT TTA TTA GTT GGA TTC AAC AGC GCA 10 56 

Ser Phe Asn Lys Val Pro Leu Leu Val Gly Phe Asn Ser Ala 
325 330 335 

GAA GGA CTT TTG TAC AAA TTC TTT ATG AAA GAA AAA CCA GAG 10 9 8 

20 Glu Gly Leu Leu Tyr Lys Phe Phe Met Lys Glu Lys Pro Glu 
340 345 350 

ATG CTG AAC CAA GCT GAA GCA GAT TTC GAA AGA CTC GTA CCA 114 0 

Met Leu Asn Gin Ala Glu Ala Asp Phe Glu Arg Leu Val Pro 
355 360 

25 GCC GAA TTT GAA TTA GCC CAT GGA TCA GAA GAA TCG AAA AAA 1182 
Ala Glu Phe Glu Leu Ala His Gly Ser Glu Glu Ser Lys Lys 
365 370 375 

CTT GCA GAA AAA ATC AGG AAG TTT TAC TTT GAC GAT AAA CCC 12 2 4 

Leu Ala Glu Lys He Arg Lys Phe Tyr Phe Asp Asp Lys Pro 
30 380 385 390 

GTT CCT GAA AAT GAG CAG AAA TTT ATT GAC TTG ATA GGA GAT 12 66 

Val Pro Glu Asn Glu Gin Lys Phe He Asp Leu He Gly Asp 
395 400 405 

ATT TGG TTT rt C: AGA GGC ATT GAC AAG CAT GTC AAG TTG TCT 13 0 8 

35 He Trp Phe Thr Arg Gly He Asp Lys His Val Lys Leu Ser 
410 415 420 

GTA GAA AAA CAA GAC GAG CCA GTA TAT TAT TAT GAA TAT TCT 13 5 0 

Val Glu Lys Gin Asp Glu Pro Val Tyr Tyr Tyr Glu Tyr Ser 
425 430 

40 TTC TCT GAA AGT CAT CCT GCA AAA GGA ACA TTT GGT GAC CAT 13 92 

Phe Ser Glu Ser His Pro Ala Lys Gly Thr Phe Gly Asp His 
435 440 445 
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AAC TTG ACT GGA GCA TGT CAT GGT GAA GAA CTT GTG AAT TTA 14 34 

Asn Leu Thr Gly Ala Cys His Gly Glu Glu Leu Val Asn Leu 
450 455 460 

TTC AAA GTC GAG ATG ATG AAG CTG GAA AAA GAT AAA CCG AAT 14 7 6 

5 Phe Lys Val Glu Met Met Lys Leu Glu Lys Asp Lys Pro Asn 
465 470 475 

GTT TTA TTA ACA AAA GAT AGG GTA CTT GCT ATG TGG ACG AAC 1518 
Val Leu Leu Thr Lys Asp Arg Val Leu Ala Met Trp Thr Asn 
480 485 490 

10 TTC ATC AAA AAT GGA AAT CCT ACT CCT GAA GTA ACT GAA TTA 1560 
Phe He Lys Asn Gly Asn Pro Thr Pro Glu Val Thr Glu Leu 
495 500 

TTG CCA GTT AAA TGG GAA CCT GCC ACA AAA GAC AAG TTG AAT 1602 
Leu Pro Val Lys Trp Glu Pro Ala Thr Lys Asp Lys Leu Asn 
15 505 510 515 

TAT TTG AAC ATT GAT GCC ACC TTA ACT TTG GGA ACA AAT CCA 1644 
Tyr Leu Asn He Asp Ala Thr Leu Thr Leu Gly Thr Asn Pro 
520 525 530 

GAA GAA ACC CGA GTC AAA TTY TGG GAA GAT GCC ACA AAA ACT 1686 
20 Glu Glu Thr Arg Val Lys Phe Trp Glu Asp Ala Thr Lys Thr 
535 540 545 

TTG CAC AGT CAA TAA AAATGTATGA AAATTGTTTT AATTATTTTA 17 31 
Leu His Ser Gin 
550 

25 GGTAATACAT TAGGTAAATA AAAATTNAAA AATAACNAAA AAAAAAAAAA 17 81 

AAAAAAAAAA A 1792 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 

30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

("ii) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Ser Arg Val He Phe Leu Ser Cys He Phe Leu Phe Ser 
35 1 5 10 

Phe Asn Phe He Lys Cys Asp Pro Pro Thr Val Thr Leu Pro 
15 20 25 

Gin Gly Glu Leu Val Gly Lys Ala Leu Thr Asn Glu Asn Gly 
30 35 40 



40 Lys Glu Tyr Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro 
45 50 55 
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Pro Val Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu 
60 65 70 

Pro Trp Asn Gly Val Phe Asn Ala Thr Ser His Gly Asn Val 
75 80 

5 Cys Lys Ala Leu Asn Phe Phe Leu Lys Lys He Glu Gly Asp 
85 90 95 

Glu Asp Cys Leu Leu Val Asn Val Tyr Ala Pro Lys Thr Thr 
100 105 HO 

Ser Asp Lys Lys Leu Pro Val Phe Phe Trp Val His Gly Gly 
10 115 ' 120 125 

Gly Phe Val Thr Gly Ser Gly Asn Leu Glu Phe Gin Ser Pro 
130 135 140 

Asp Tyr Leu Val Asn Tyr Asp Val He Phe Val Thr Phe Asn 
145 150 

15 Tyr Arg Leu Gly Pro Leu Gly Phe Leu Asn Leu Glu Leu Glu 
155 160 165 

Gly Ala Pro Gly Asn Val Gly Leu Leu Asp Gin Val Ala Ala 
170 175 180 

Leu Lys Trp Thr Lys Glu Asn He Glu Lys Phe Gly Gly Asp 
20 185 190 195 

Pro Glu Asn He Thr He Gly Gly Val Ser Ala Gly Gly Ala 
200 205 210 

Ser Val His Tyr Leu Leu Leu Ser His Thr Thr Thr Gly Leu 
215 220 

25 Tyr Lys Arg Ala He Ala Gin Ser Gly Ser Ala Leu Asn Pro 
225 230 235 

Trp Ala Phe Gin Arg His Pro Val Lys Arg Ser Leu Gin Leu 
240 245 250 

Ala Glu He Leu Gly His Pro Thr Asn Asn Thr Gin Asp Ala 
30 255 260 265 

Leu Glu Phe Leu Gin Lys Ala Pro Val Asp Ser Leu Leu Lys 
270 275 280 

Lys Met Pro Ala Glu Thr Glu Gly Glu He He Glu Glu Phe 
285 290 

35 Val Phe Val Pro Ser He Glu Lys Val Phe Pro Ser His Gin 
295 300 305 



Pro Phe Leu Glu Glu Ser Pro Leu Ala Arg Met Lys Ser Gly 
310 315 320 
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Ser Phe Asn Lys Val Pro Leu Leu Val Gly Phe Asn Ser Ala 
325 330 335 

Glu Gly Leu Leu Tyr Lys Phe Phe Met Lys Glu Lys Pro Glu 
340 345 350 

5 Met Leu Asn Gin Ala Glu Ala Asp Phe Glu Arg Leu Val Pro 

355 360 

Ala Glu Phe Glu Leu Ala His Gly Ser Glu Glu Ser Lys Lys 
365 370 375 

Leu Ala Glu Lys lie Arg Lys Phe Tyr Phe Asp Asp Lys Pro 
10 380 385 390 

Val Pro Glu Asn Glu Gin Lys Phe lie Asp Leu lie Gly Asp 
395 400 405 

lie Trp Phe Thr Arg Gly He Asp Lys His Val Lys Leu Ser 
410 415 420 

15 Val Glu Lys Gin Asp Glu Pro Val Tyr Tyr Tyr Glu Tyr Ser 

425 430 

Phe Ser Glu Ser His Pro Ala Lys Gly Thr Phe Gly Asp His 
435 440 445 

Asn Leu Thr Gly Ala Cys His Gly Glu Glu Leu Val Asn Leu 
20 450 455 460 

Phe Lys Val Glu Met Met Lys Leu Glu Lys Asp Lys Pro Asn 
465 470 475 

Val Leu Leu Thr Lys Asp Arg Val Leu Ala Met Trp Thr Asn 
480 485 490 

25 Phe He Lys Asn Gly Asn Pro Thr Pro Glu Val Thr Glu Leu 

495 500 

Leu Pro Val Lys Trp Glu Pro Ala Thr Lys Asp Lys Leu Asn 
505 510 515 

Tyr Leu Asn He Asp Ala Thr Leu Thr Leu Gly Thr Asn Pro 
30 520 525 530 

Glu Glu Thr Arg Val Lys Phe Trp Glu Asp Ala Thr ijys Thr 
535 540 545 

Leu His Ser Gin 
550 

35 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1792 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



-122- FC-1-C1-PCT 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TTTTTTTTTT TTTTTTTTTT TTTTNGTTAT TTTTNAATTT TTATTT AC CT 5 0 

5 AATGTATTAC CTAAAATAAT TAAAACAATT TTCATACATT TTTATTGACT 100 

GTGCAAAGTT TTTGTGGCAT CTTCCCARAA TTTGACTCGG GTTTCTTCTG 15 0 

GATTTGTTCC CAAAGTTAAG GTGGCATCAA TGTTCAAATA ATTCAACTTG 2 00 

TCTTTTGTGG CAGGTTCCCA TTTAACTGGC AATAATTCAG TTACTTCAGG 25 0 

AGTAGGATTT CCATTTTTGA TGAAGTTCGT CC AC AT AG C A AGTACCCTAT 3 00 

10 CTTTTGTTAA TAAAACATTC GGTTTATCTT TTTCCAGCTT CATCATCTCG 3 50 

ACTTTGAATA AATTCACAAG TTCTTCACCA TGACATGCTC CAGTCAAGTT 4 00 

ATGGTCACCA AATGTTCCTT TTGCAGGATG ACTTTCAGAG AAAGAATATT 450 

CATAATAATA TACTGGCTCG TCTTGTTTTT CTACAGACAA CTTGACATGC 5 00 

TTGTCAATGC CTCTAGTAAA CCAAATATCT CCTATCAAGT CAATAAATTT 5 50 

15 CTGCTCATTT TCAGGAACGG GTTTATCGTC AAAGTAAAAC TTCCTGATTT 60 0 

TTTCTGCAAG TTTTTTCGAT TCTTCTGATC CATGGGCTAA TTCAAATTCG 650 

GCTGGTACGA GTCTTTCGAA ATCTGCTTCA GCTTGGTTCA GCATCTCTGG 700 

TTTTTCTTTC ATAAAGAATT TGTACAAAAG TCCTTCTGCG CTGTTGAATC 750 

CAACTAATAA AGGTACTTTG TTAAAGGATC CAGATTTCAT TCTGGCCAAT 8 00 

20 GGTGATTCTT CCAAGAAAGG TTGGTGGGAT GGGAAAACTT TTTCAATTGA 850 

TGGTACGAAG ACGAACTCTT CTATTATTTC ACCTTCTGTT TCAGCTGGCA 900 

TTTTTTTCAG GAGACTGTCT ACTGGGGCTT TTTGTAAGAA TTCTAAAGCA 95 0 

TCTTGAGTGT TGTTTGTGGG ATGACCTAAT ATCTCAGCAA GTTGAAGACT 100 0 

ACGCTTTACT GGATGTCTTT GGAAGGCCCA TGGATTTAAA GCACTTCCAC 105 0 

25 TTTGAGCAAT TGCCCTTTTG TAAAGTCCAG TGGTTGTATG TGACAATAAA 110 0 

AGATAATGAA CACTTGCTCC ACCAGCAGAA ACACCACCAA TTGTAATATT 115 0 

TTCTGGATCT CCACCAAATT TCTCAATATT TTCTTTGGTC CATTTCAAAG 12 0 0 

CTGCTACCTG ATCCAATAAT CCTACATTTC CAGGAGCACC TTCCAACTCC 12 5 0 

AAATTCAAAA ATCCGAGTGG TCCCAATCGG TAATTGAAAG TTACAAAAAT 13 0 0 

30 AACATCATAA TTTACTAAAT AATCTGGGCT TTGAAATTCT AAATTTCCGG 13 50 

ATCCAGTCAC AAAACCGCCA CCATGAACCC AGAAAAATAC TGGAAGTTTT 14 0 0 

TTGTCAGAAG TTGTTTTTGG TGCGTACACA TTCACCAACA AGCAGTCTTC 14 50 

GTCTCCTTCA ATTTTTTTCA AGAAGAAATT CAAAGCTTTG CACACATTTC 150 0 

CATGTGATGT GGCGTTGAAA ACACCATTCC ATGGCTCAGC TTTCTGTGGA 155 0 

35 GGCTTAAATC TAAGTTCTCC AACTGGAGGT TTAGCATAAG GCACACCTGT 160 0 

GTAGCTAAAA TACTCTTTTC CATTTTCGTT CGTCAAAGCT TTTCCAACCA 16 5 0 

ATTCGCCCTG GGGCAAAGTT ACAGTCGGGG GATCACATTT TATAAAATTA 17 00 

AAACTAAACA AAAAAATACA ACTTAAAAAA ATAACACGAG ACATCTTGGA 17 5 0 

TCTAGACTAT TGACTGTGTG TACTGAATTA TTAGCACACA GT 17 92 

40 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) 



SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



-123- 



ATG TCT CGT GTT ATT TTT TTA AGT TGT ATT TTT TTG TTT AGT 
Met Ser Arg Val lie Phe Leu Ser Cys lie Phe Leu Phe Ser 
1 5 10 

TTT AAT TTT ATA AAA TGT GAT CCC CCG ACT GTA ACT TTG CCC 
5 Phe Asn Phe lie Lys Cys Asp Pro Pro Thr Val Thr Leu Pro 
15 20 25 

CAG GGC GAA TTG GTT GGA AAA GCT TTG ACG AAC GAA AAT GGA 
Gin Gly Glu Leu Val Gly Lys Ala Leu Thr Asn Glu Asn Gly 
30 35 40 

10 AAA GAG TAT TTT AGC TAC ACA GGT GTG CCT TAT GCT AAA CCT 
Lys Glu Tyr Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro 
45 50 55 

CCA GTT GGA GAA CTT AGA TTT AAG CCT CCA CAG AAA GCT GAG 
Pro Val Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu 
15 60 65 70 

CCA TGG AAT GGT GTT TTC AAC GCC ACA TCA CAT GGA AAT GTG 
Pro Trp Asn Gly Val Phe Asn Ala Thr Ser His Gly Asn Val 
75 80 

TGC AAA GCT TTG AAT TTC TTC TTG AAA AAA ATT GAA GGA GAC 
20 Cys Lys Ala Leu Asn Phe Phe Leu Lys Lys lie Glu Gly Asp 
85 90 95 

GAA GAC TGC TTG TTG GTG AAT GTG TAC GCA CCA AAA ACA ACT 
Glu Asp Cys Leu Leu Val Asn Val Tyr Ala Pro Lys Thr Thr 
100 105 110 

25 TCT GAC AAA AAA CTT CCA GTA TTT TTC TGG GTT CAT GGT GGC 
Ser Asp Lys Lys Leu Pro Val Phe Phe Trp Val His Gly Gly 
115 120 125 

GGT TTT GTG ACT GGA TCC GGA AAT TTA GAA TTT CAA AGC CCA 
Gly Phe Val Thr Gly Ser Gly Asn Leu Glu Phe Gin Ser Pro 
30 130 135 140 

GAT TAT TTA GTA AAT TAT GAT GTT ATT TTT GTA ACT TTC AAT 
Asp Tyr Leu Val Asn Tyr Asp Val lie Phe Val Thr Phe Asn 
145 150 

TAC CGA TTG GGA CCA CTC GGA TTT TTG AAT TTG GAG TTG GAA 
35 Tyr Arg Leu Gly Pro Leu Gly Phe Leu Asn Leu Glu Leu Glu 
155 160 165 

GGT GCT CCT GGA AAT GTA GGA TTA TTG GAT CAG GTA GCA GCT 
Gly Ala Pro Gly Asn Val Gly Leu Leu Asp Gin Val Ala Ala 
170 175 180 

40 TTG AAA TGG ACC AAA GAA AAT ATT GAG AAA TTT GGT GGA GAT 
Leu Lys Trp Thr Lys Glu Asn lie Glu Lys Phe Gly Gly Asp 
185 190 195 



FC-1-C1-PCT 

42 



84 



126 



168 



210 



252 



294 



336 



378 



420 



462 



504 



546 



588 



-124- FC-1-C1-PCT 



CCA GAA AAT ATT ACA ATT GGT GGT GTT TCT GCT GGT GGA GCA 63 0 

Pro Glu Asn lie Thr He Gly Gly Val Ser Ala Gly Gly Ala 
200 205 210 

AGT GTT CAT TAT CTT TTA TTG TCA CAT ACA ACC ACT GGA CTT 67 2 

5 Ser Val His Tyr Leu Leu Leu Ser His Thr Thr Thr Gly Leu 

215 220 

TAC AAA AGG GCA ATT GCT CAA AGT GGA AGT GCT TTA AAT CCA 714 
Tyr Lys Arg Ala He Ala Gin Ser Gly Ser Ala Leu Asn Pro 
225 230 235 

10 TGG GCC TTC CAA AGA CAT CCA GTA AAG CGT AGT CTT CAA CTT 75 6 

Trp Ala Phe Gin Arg His Pro Val Lys Arg Ser Leu Gin Leu 
240 245 250 

GCT GAG ATA TTA GGT CAT CCC ACA AAC AAC ACT CAA GAT GCT 7 98 

Ala Glu He Leu Gly His Pro Thr Asn Asn Thr Gin Asp Ala 
15 255 260 265 

TTA GAA TTC TTA CAA AAA GCC CCA GTA GAC AGT CTC CTG AAA 84 0 

Leu Glu Phe Leu Gin Lys Ala Pro Val Asp Ser Leu Leu Lys 
270 275 280 

AAA ATG CCA GCT GAA ACA GAA GGT GAA ATA ATA GAA GAG TTC 8 82 

20 Lys Met Pro Ala Glu Thr Glu Gly Glu He He Glu Glu Phe 

285 290 

GTC TTC GTA CCA TCA ATT GAA AAA GTT TTC CCA TCC CAC CAA 924 
Val Phe Val Pro Ser He Glu Lys Val Phe Pro Ser His Gin 
295 300 305 

25 CCT TTC TTG GAA GAA TCA CCA TTG GCC AGA ATG AAA TCT GGA 966 
Pro Phe Leu Glu Glu Ser Pro Leu Ala Arg Met Lys Ser Gly 
310 315 320 

TCC TTT AAC AAA GTA CCT TTA TTA GTT GGA TTC AAC AGC GCA 1008 
Ser Phe Asn Lys Val Pro Leu Leu Val Gly Phe Asn Ser Ala 
30 325 330 335 

GAA GGA CTT TTG TAC AAA TTC TTT ATG AAA GAA AAA CCA GAG 105 0 

Glu Gly Leu Leu Tyr Lys Phe Phe Met Lys Glu Lys Pro Glu 
340 345 350 

ATG CTG AAC CAA GCT GAA GCA GAT TTC GAA AGA CTC GVA CCA 10 9^. 

35 Met Leu Asn Gin Ala Glu Ala Asp Phe Glu Arg Leu Val Pro 

355 360 

GCC GAA TTT GAA TTA GCC CAT GGA TCA GAA GAA TCG AAA AAA 113 4 

Ala Glu Phe Glu Leu Ala His Gly Ser Glu Glu Ser Lys Lys 
365 370 375 



40 CTT GCA GAA AAA ATC AGG AAG TTT TAC TTT GAC GAT AAA CCC 
Leu Ala Glu Lys He Arg Lys Phe Tyr Phe Asp Asp Lys Pro 
380 385 390 



1176 



-125- 



FC-1-C1-PCT 



GTT CCT GAA AAT GAG CAG AAA TTT ATT GAC TTG ATA GGA GAT 1218 
Val Pro Glu Asn Glu Gin Lys Phe lie Asp Leu lie Gly Asp 
395 400 405 

ATT TGG TTT ACT AG A GGC ATT GAC AAG CAT GTC AAG TTG TCT 1260 
5 lie Trp Phe Thr Arg Gly lie Asp Lys His Val Lys Leu Ser 
410 415 420 

GTA GAA AAA CAA GAC GAG CCA GTA TAT TAT TAT GAA TAT TCT 13 02 

Val Glu Lys Gin Asp Glu Pro Val Tyr Tyr Tyr Glu Tyr Ser 
425 430 . 

10 TTC TCT GAA AGT CAT CCT GCA AAA GGA ACA TTT GGT GAC CAT 1344 
Phe Ser Glu Ser His Pro Ala Lys Gly Thr Phe Gly Asp His 
435 440 445 

AAC TTG ACT GGA GCA TGT CAT GGT GAA GAA CTT GTG AAT TTA 13 86 

Asn Leu Thr Gly Ala Cys His Gly Glu Glu Leu Val Asn Leu 
15 450 455 460 

TTC AAA GTC GAG ATG ATG AAG CTG GAA AAA GAT AAA CCG AAT 142 8 

Phe Lys Val Glu Met Met Lys Leu Glu Lys Asp Lys Pro Asn 
465 470 475 

GTT TTA TTA ACA AAA GAT AGG GTA CTT GCT ATG TGG ACG AAC 147 0 

20 Val Leu Leu Thr Lys Asp Arg Val Leu Ala Met Trp Thr Asn 
480 485 490 

TTC ATC AAA AAT GGA AAT CCT ACT CCT GAA GTA ACT GAA TTA 1512 
Phe lie Lys Asn Gly Asn Pro Thr Pro Glu Val Thr Glu Leu 
495 500 

25 TTG CCA GTT AAA TGG GAA CCT GCC ACA AAA GAC AAG TTG AAT 1554 
Leu Pro Val Lys Trp Glu Pro Ala Thr Lys Asp Lys Leu Asn 
505 510 515 

TAT TTG AAC ATT GAT GCC ACC TTA ACT TTG GGA ACA AAT CCA 15 96 

Tyr Leu Asn lie Asp Ala Thr Leu Thr Leu Gly Thr Asn Pro 
30 520 525 530 

GAA GAA ACC CGA GTC AAA TTY TGG GAA GAT GCC ACA AAA ACT 163 8 

Glu Glu Thr Arg Val Lys Phe Trp Glu Asp Ala Thr Lys Thr 
535 540 545 

TTG CAC AGT CAA 1650 
35 Leu His Ser Gin 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 nucleotides 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



-126- 



FC-1-C1-PCT 



(iii) SEQUENCE DESCRIPTION: SEQ ID NO:22: 



TTGACTGTGC AAAGTTTTTG TGGCATCTTC CCARAATTTG ACTCGGGTTT 50 

CTTCTGGATT TGTTCCCAAA GTTAAGGTGG CATCAATGTT CAAATAATTC 100 

AACTTGTCTT TTGTGGCAGG TTCCCATTTA ACTGGCAATA ATTCAGTTAC 150 

5 TTCAGGAGTA GGATTTCCAT TTTTGATGAA GTTCGTCCAC ATAGCAAGTA 2 00 

CCCTATCTTT TGTTAATAAA ACATTCGGTT TATCTTTTTC CAGCTTCATC 2 50 

ATCTCGACTT TGAATAAATT CACAAGTTCT TCACCATGAC ATGCTCCAGT 3 00 

CAAGTTATGG TCACCAAATG TTCCTTTTGC AGGATGACTT TCAGAGAAAG 3 50 

AATATTCATA ATAATATACT GGCTCGTCTT GTTTTTCTAC AGACAACTTG 4 00 

10 ACATGCTTGT CAATGCCTCT AGTAAACCAA ATATCTCCTA TCAAGTCAAT 4 50 

AAATTTCTGC TCATTTTCAG GAACGGGTTT ATCGTCAAAG TAAAACTTCC 500 

TGATTTTTTC TGCAAGTTTT TTCGATTCTT CTGATCCATG GGCTAATTCA 5 50 

AATTCGGCTG GTACGAGTCT TTCGAAATCT GCTTCAGCTT GGTTCAGCAT 600 

CTCTGGTTTT TCTTTCATAA AGAATTTGTA CAAAAGTCCT TCTGCGCTGT 650 

15 TGAATCCAAC TAATAAAGGT ACTTTGTTAA AGGATCCAGA TTTCATTCTG 7 00 

GCCAATGGTG ATTCTTCCAA GAAAGGTTGG TGGGATGGGA AAACTTTTTC 750 

AATTGATGGT ACGAAGACGA ACTCTTCTAT TATTTCACCT TCTGTTTCAG 800 

CTGGCATTTT TTTCAGGAGA CTGTCTACTG GGGCTTTTTG TAAGAATTCT 850 

AAAGCATCTT GAGTGTTGTT TGTGGGATGA CCTAATATCT CAGCAAGTTG 900 

20 AAGACTACGC TTTACTGGAT GTCTTTGGAA GGCCCATGGA TTTAAAGCAC 95 0 

TTCCACTTTG AGCAATTGCC CTTTTGTAAA GTCCAGTGGT TGTATGTGAC 100 0 

AATAAAAGAT AATGAACACT TGCTCCACCA GCAGAAACAC CACCAATTGT 105 0 

AATATTTTCT GGATCTCCAC CAAATTTCTC AATATTTTCT TTGGTCCATT 110 0 

TCAAAGCTGC TACCTGATCC AATAATCCTA CATTTCCAGG AGCACCTTCC 1150 

25 AACTCCAAAT TCAAAAATCC GAGTGGTCCC AATCGGTAAT TGAAAGTTAC 12 0 0 

AAAAATAACA TCATAATTTA CTAAATAATC TGGGCTTTGA AATTCTAAAT 1250 

TTCCGGATCC AGTCACAAAA CCGCCACCAT GAACCCAGAA AAATACTGGA 13 0 0 

AGTTTTTTGT CAGAAGTTGT TTTTGGTGCG TACACATTCA CCAACAAGCA 135 0 

GTCTTCGTCT CCTTCAATTT TTTTCAAGAA GAAATTCAAA GCTTTGCACA 14 0 0 

30 CATTTCCATG TGATGTGGCG TTGAAAACAC CATTCCATGG CTCAGCTTTC 14 5 0 

TGTGGAGGCT TAAATCTAAG TTCTCCAACT GGAGGTTTAG CATAAGGCAC 15 0 0 

ACCTGTGTAG CTAAAATACT CTTTTCCATT TTCGTTCGTC AAAGCTTTTC 1550 

CAACCAATTC GCCCTGGGGC AAAGTTACAG TCGGGGGATC ACATTTTATA 160 0 

AAATTAAAAC TAAACAAAAA AATACAACTT AAAAAAATAA CACGAGACAT 165 0 



35 (2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1590 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(iii) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1590 



45 <iv) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

GAT CCC CCG ACT GTA ACT TTG CCC CAG GGC GAA TTG GTT GGA 4 2 

Asp Pro Pro Thr Val Thr Leu Pro Gin Gly Glu Leu Val Gly 
15 10 



-127- 



FC-1-C1-PCT 



AAA GCT TTG ACG AAC GAA AAT GGA AAA GAG TAT TTT AGC TAC 84 
Lys Ala Leu Thr Asn Glu Asn Gly Lys Glu Tyr Phe Ser Tyr 
15 20 25 

ACA GGT GTG CCT TAT GCT AAA CCT CCA GTT GGA GAA CTT AGA 12 6 

5 Thr Gly Val Pro Tyr Ala Lys Pro Pro Val Gly Glu Leu Arg 
30 35 40 

TTT AAG CCT CCA CAG AAA GCT GAG CCA TGG AAT GGT GTT TTC 168 
Phe Lys Pro Pro Gin Lys Ala Glu Pro Trp Asn Gly Val Phe 
45 50 55 

10 AAC GCC ACA TCA CAT GGA AAT GTG TGC AAA GCT TTG AAT TTC 210 
Asn Ala Thr Ser His Gly Asn Val Cys Lys Ala Leu Asn Phe 
60 65 70 

TTC TTG AAA AAA ATT GAA GGA GAC GAA GAC TGC TTG TTG GTG 252 
Phe Leu Lys Lys lie Glu Gly Asp Glu Asp Cys Leu Leu Val 
15 75 80 

AAT GTG TAC GCA CCA AAA ACA ACT TCT GAC AAA AAA CTT CCA 2 94 

Asn Val Tyr Ala Pro Lys Thr Thr Ser Asp Lys Lys Leu Pro 
85 90 95 

GTA TTT TTC TGG GTT CAT GGT GGC GGT TTT GTG ACT GGA TCC 336 
20 Val Phe Phe Trp Val His Gly Gly Gly Phe Val Thr Gly Ser 
100 105 110 

GGA AAT TTA GAA TTT CAA AGC CCA GAT TAT TTA GTA AAT TAT 378 
Gly Asn Leu Glu Phe Gin Ser Pro Asp Tyr Leu Val Asn Tyr 
115 120 125 

25 GAT GTT ATT TTT GTA ACT TTC AAT TAC CGA TTG GGA CCA CTC 42 0 

Asp Val lie Phe Val Thr Phe Asn Tyr Arg Leu Gly Pro Leu 
130 135 140 

GGA TTT TTG AAT TTG GAG TTG GAA GGT GCT CCT GGA AAT GTA 4 62 

Gly Phe Leu Asn Leu Glu Leu Glu Gly Ala Pro Gly Asn Val 
30 145 15 0 

GGA TTA TTG GAT CAG GTA GCA GCT TTG AAA TGG ACC AAA GAA 5 04 

Gly Leu Leu Asp Gin Val Ala Ala Leu Lys Trp Thr Lys Glu 
155 160 165 

AAT ATT GAG AAA TTT GGT GGA GAT CCA GAA AAT ATT ACA ATT 54 6 

35 Asn lie Glu Lys Phe Gly Gly Asp Pro Glu Asn lie Thr lie 
170 175 180 

GGT GGT GTT TCT GCT GGT GGA GCA AGT GTT CAT TAT CTT TTA 58 8 

Gly Gly Val Ser Ala Gly Gly Ala Ser Val His Tyr Leu Leu 
185 190 195 



40 TTG TCA CAT ACA ACC ACT GGA CTT TAC AAA AGG GCA ATT GCT 
Leu Ser His Thr Thr Thr Gly Leu Tyr Lys Arg Ala lie Ala 
200 205 210 



630 



-128- 



CAA AGT 
Gin Ser 



CCA GTA 
5 Pro Val 

225 

CCC ACA 
Pro Thr 
240 

10 GCC CCA 
Ala Pro 



GAA GGT 
Glu Gly 

15 

GAA AAA 
Glu Lys 



CCA TTG 
20 Pro Leu 
295 

TTA TTA 
Leu Leu 
310 

25 TTC TTT 
Phe Phe 



GCA GAT 
Ala Asp 

30 

CAT GGA 
His Gly 



AAG TTT 
35 Lys Phe 
365 

AAA TTT 
Lys Phe 
380 

40 ATT GAC 
lie Asp 



GGA AGT GCT 
Gly Ser Ala 
215 

AAG CGT AGT 
Lys Arg Ser 



AAC AAC ACT 
Asn Asn Thr 



GTA GAC AGT 
Val Asp Ser 
255 

GAA ATA ATA 
Glu lie lie 
270 

GTT TTC CCA 
Val Phe Pro 
285 

GCC AGA ATG 
Ala Arg Met 



GTT GGA TTC 
Val Gly Phe 



ATG AAA GAA 
Met Lys Glu 
325 

TTC GAA AGA 
Phe Glu Arg 
340 

TCA GAA GAA 
Ser Glu Glu 
355 

TAC TTT GAC 
Tyr Phe Asp 

ATT GAC TTG 
lie Asp Leu 



AAG CAT GTC 
Lys His Val 
395 



TTA AAT CCA 
Leu Asn Pro 



CTT CAA CTT 
Leu Gin Leu 
230 

CAA GAT GCT 
Gin Asp Ala 
245 

CTC CTG AAA 
Leu Leu Lys 
260 

GAA GAG TTC 
Glu Glu Phe 



TCC CAC CAA 
Ser His Gin 



AAA TCT GGA 
Lys Ser Gly 
300 

AAC AGC GCA 
Asn Ser Ala 
315 

AAA CCA GAG 
Lys Pro Glu 
330 

CTC GTA CCA 
Leu Val Pro 



TCG AAA AAA 
Ser Lys Lys 

GAT AAA CCC 
Asp Lys Pro 
370 

ATA GGA GAT 
lie Gly Asp 
385 

AAG TTG TCT 
Lys Leu Ser 
400 



TGG GCC TTC 
Trp Ala Phe 
220 

GCT GAG ATA 
Ala Glu He 
235 

TTA GAA TTC 
Leu Glu Phe 



AAA ATG CCA 
Lys Met Pro 



GTC TTC GTA 
Val Phe Val 
275 

CCT TTC TTG 
Pro Phe Leu 
290 

TCC TTT AAC 
Ser Phe Asn 
305 

GAA GGA CTT 
Glu Gly Leu 



ATG CTG AAC 
Met Leu Asn 



GCC GAA TTT 
Ala Glu Phe 
345 

CTT GCA GAA 
Leu Ala Glu 
360 

GTT CCT GAA 
Val Pro Glu 
375 

ATT TGG TTT 
He Trp Phe 



GTA GAA AAA 
Val Glu Lys 



CAA AGA CAT 
Gin Arg His 



TTA GGT CAT 
Leu Gly His 

TTA CAA AAA 
Leu Gin Lys 
250 

GCT GAA ACA 
Ala Glu Thr 
265 

CCA TCA ATT 
Pro Ser He 
280 

GAA GAA TCA 
Glu Glu Ser 



AAA GTA CCT 
Lys Val Pro 



TTG TAC AAA 
Leu Tyr Lys 
320 

CAA GCT GAA 
Gin Ala Glu 
335 

GAA TTA GCC 
Glu Leu Ala 
350 

AAA ATC AGG 
Lys He Arg 



AAT GAG CAG 
Asn Glu Gin 



ACT AGA GGC 
Thr Arg Gly 
390 

CAA GAC GAG 
Gin Asp Glu 
405 



FC-1-C1-PCT 

672 



714 



756 



798 



840 



882 



924 



966 



1008 



1050 



1092 



1134 



1176 



1218 



-129- 



FC-1-C1-PCT 



CCA GTA TAT TAT TAT GAA TAT TCT TTC TCT GAA AGT CAT CCT 12 60 

Pro Val Tyr Tyr Tyr Glu Tyr Ser Phe Ser Glu Ser His Pro 
410 415 420 

GCA AAA GGA ACA TTT GGT GAC CAT AAC TTG ACT GGA GCA TGT 13 02 

5 Ala Lys Gly Thr Phe Gly Asp His Asn Leu Thr Gly Ala Cys 

425 430 

CAT GGT GAA GAA CTT GTG AAT TTA TTC AAA GTC GAG ATG ATG 1344 
His Gly Glu Glu Leu Val Asn Leu Phe Lys Val Glu Met Met 
435 440 445 

10 AAG CTG GAA AAA GAT AAA CCG AAT GTT TTA TTA ACA AAA GAT 13 8 6 

Lys Leu Glu Lys Asp Lys Pro Asn Val Leu Leu Thr Lys Asp 
450 455 460 



AGG GTA CTT GCT ATG TGG ACG AAC TTC ATC AAA AAT GGA AAT 142 8 

Arg Val Leu Ala Met Trp Thr Asn Phe lie Lys Asn Gly Asn 
15 465 470 475 

CCT ACT CCT GAA GTA ACT GAA TTA TTG CCA GTT AAA TGG GAA 14 7 0 

Pro Thr Pro Glu Val Thr Glu Leu Leu Pro Val Lys Trp Glu 
480 485 490 



CCT GCC ACA AAA GAC AAG TTG AAT TAT TTG AAC ATT GAT GCC 1512 
20 Pro Ala Thr Lys Asp Lys Leu Asn Tyr Leu Asn lie Asp Ala 

495 500 



ACC TTA ACT TTG GGA ACA AAT CCA GAA GAA ACC CGA GTC AAA 15 54 

Thr Leu Thr Leu Gly Thr Asn Pro Glu Glu Thr Arg Val Lys 
505 510 515 



25 TTY TGG GAA GAT GCC ACA AAA ACT TTG CAC AGT CAA 15 90 

Phe Trp Glu Asp Ala Thr Lys Thr Leu His Ser Gin 
520 525 530 



(2) INFORMATION FOR SEQ ID NO: 24: 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2836 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TCPOLOGY: linear 



35 (ii) MOLECULE TYPE: cDNa 

(iii) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 99. .1889 



(iv) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

40 TAGACATGTC GTCTTCAAAA CGTCTATTTT ATCATAAACA AAACGAGATA 5 0 

AATAATAACA ATTAAGCAAC CAAAATGCAT TAAAAAACAC AATAAAAA 98 



•130- FC-1-C1-PCT 



ATG TTA CCT CAC AGT AGT GCA TTA GTT TTA TTT TTA TTT TTT 14 0 

Met Leu Pro His Ser Ser Ala Leu Val Leu Phe Leu Phe Phe 
1 5 10 

TTA TTT TTC TTA TTT ACA CCT ATC TTG TGC ATA CTA TGG GAT 182 
5 Leu Phe Phe Leu Phe Thr Pro lie Leu Cys lie Leu Trp Asp 
15 20 25 

AAC CTA GAT CAG CAT TTG TGC AGA GTT CAA TTT AAC GGG ATC 2 24 

Asn Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn Gly lie 
30 35 40 

10 ACG GAA GGA AAA CCG TTC CGA TAT AAA GAt CAT AGG AAT GAT 2 66 

Thr Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Arg Asn Asp 
45 50 55 

GTA TAT TGT TCT TAT TTG GGA ATT CCT TAT GCC GAA CCG CCT 3 08 

Val Tyr Cys Ser Tyr Leu Gly lie Pro Tyr Ala Glu Pro Pro 
15 60 65 70 

TTT GGA CCA TTA CGA TTT CAG TCT CCA AAA CCA ATA TCA AAT 3 50 

Phe Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro lie Ser Asn 
75 80 

CCA AAA ACA GGA TTC GTA CAG GCT CGA ACT TTG GGA GAC AAA 3 92 

20 Pro Lys Thr Gly Phe Val Gin Ala Arg Thr Leu Gly Asp Lys 
85 90 95 

TGT TTC CAG GAA AGT CTA ATA TAT TCT TAT GCA GGA AGC GAA 4 34 

Cys Phe Gin Glu Ser Leu lie Tyr Ser Tyr Ala Gly Ser Glu 
100 105 110 

25 GAT TGC TTA TAT CTG AAT ATA TTC ACG CCA GAG ACT GTT AAT 4 76 

Asp Cys Leu Tyr Leu Asn lie Phe Thr Pro Glu Thr Val Asn 
115 120 125 

TCT GCG AAC AAT ACA AAA TAT CCT GTA ATG TTC TGG ATC CAT 518 
Ser Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp lie His 
30 130 135 140 

GGA GGC GCA TTC AAC CAA GGA TCA GGA TCT TAT AAT TTT TTT 5 60 

Gly Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn Phe Phe 
145 150 

GGA CCT G'vT TAT TTG ATC AGG GAA GGA ATT ATT TTG GTC ACT 602 

35 Gly Pro Asp Tyr Leu lie Arg Glu Gly lie He Leu Val Thr 
155 160 165 

ATC AAC TAT AGA TTA GGA GTT TTC GGT TTT CTA TCA GCG CCG 644 

He Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser Ala Pro 
170 175 180 

40 GAA TGG GAT ATC CAT GGA AAT ATG GGT CTA AAA GAC CAG AGA 686 
Glu Trp Asp He His Gly Asn Met Gly Leu Lys Asp Gin Arg 
185 190 195 
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TTG GCA CTA AAA TGG GTT TAC GAC AAC ATC GAA AAG TTT GGT 72 8 

Leu Ala Leu Lys Trp Val Tyr Asp Asn He Glu Lys Phe Gly 
200 205 210 

GGA GAC AGA GAA AAA ATT ACA ATT GCT GGA GAA TCT GCT GGA 77 0 

5 Gly Asp Arg Glu Lys He Thr He Ala Gly Glu Ser Ala Gly 

215 220 

GCA GCA AGT GTC CAT TTT CTG ATG ATG GAC AAC TCG ACT AGA 812 
Ala Ala Ser Val His Phe Leu Met Met Asp Asn Ser Thr Arg 
225 230 235 

10 AAA TAC TAC CAA AGG GCC ATT TTG CAG AGT GGG ACA TTA CTA 854 
Lys Tyr Tyr Gin Arg Ala He Leu Gin Ser Gly Thr Leu Leu 
240 245 250 

AAT CCG ACT GCT AAT CAA ATT CAA CTT CTG CAT AGA TTT GAA 896 
Asn Pro Thr Ala Asn Gin He Gin Leu Leu His Arg Phe Glu 
15 255 260 265 

AAA CTC AAA CAA GTG CTA AAC ATC ACG CAA AAA CAA GAA CTC 93 8 

Lys Leu Lys Gin Val Leu Asn He Thr Gin Lys Gin Glu Leu 
270 275 2 80 

CTA AAC CTG GAT AAA AAC CTA ATT TTA CGA GCA GCC TTA AAC 98 0 

20 Leu Asn Leu Asp Lys Asn Leu He Leu Arg Ala Ala Leu Asn 

285 290 

AGA GTT CCT GAT AGC AAC GAC CAT GAC CGA GAC ACA GTA CCA 102 2 

Arg Val Pro Asp Ser Asn Asp His Asp Arg Asp Thr Val Pro 
295 300 305 

25 GTA TTT AAT CCA GTC TTA GAA TCA CCA GAA TCT CCA GAT CCA 1064 
Val Phe Asn Pro Val Leu Glu Ser Pro Glu Ser Pro Asp Pro. 
310 315 320 

ATA ACA TTT CCA TCT GCC TTG GAA AGA ATG AGA AAT GGT GAA 1106 
He Thr Phe Pro Ser Ala Leu Glu Arg Met Arg Asn Gly Glu 
30 325 330 335 

TTT CCT GAT GTC GAT GTC ATC ATT GGT TTC AAT AGT GCT GAA 114 8 

Phe Pro Asp Val Asp Val He He Gly Phe Asn Ser Ala Glu 
340 345 350 

GGT TTA AGA TCT ATG GCA AGA GTA ACC AGA GGA AAC ATG GAA H90 
35 Gly Leu Arg Ser Met Ala Arg Val Thr Arg Gly Asn Met Glu 

355 360 

GTT CAC AAG ACT TTG ACA AAT ATA GAA AGG GCT ATA CCT AGA 12 3 2 

Val His Lys Thr Leu Thr Asn He Glu Arg Ala He Pro Arg 
365 370 375 
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GAT GCT AAT ATT TGG AAA AAT CCA AAT GGT ATT GAG GAG AAA 12 74 

Asp Ala Asn lie Trp Lys Asn Pro Asn Gly lie Glu Glu Lys 
380 385 390 

AAA CTA ATA AAA ATG CTT AC A GAG TTT TAT GAC CAA GTG AAA 1316 
5 Lys Leu lie Lys Met Leu Thr Glu Phe Tyr Asp Gin Val Lys 
395 400 405 

GAA CAA AAC GAT GAC ATT GAA GCC TAC GTC CAA CTA AAA GGC 13 58 

Glu Gin Asn Asp Asp lie Glu Ala Tyr Val Gin Leu Lys Gly 
410 415 420 

10 GAT GCT GGT TAC CTC CAA GGA ATC TAC CGT ACC TTG AAA GCC 14 0 0 

Asp Ala Gly Tyr Leu Gin Gly lie Tyr Arg Thr Leu Lys Ala 
425 430 

ATA TTT TTC AAT GAA TTC AG A AGG AAT TCC AAT TTG TAT TTG 144 2 

lie Phe Phe Asn Glu Phe Arg Arg Asn Ser Asn Leu Tyr Leu 
15 435 440 445 

TAC AGG TTA TCA GAC GAT ACG TAT AGT GTA TAT AAA AGT TAT 14 84 

Tyr Arg Leu Ser Asp Asp Thr Tyr Ser Val Tyr Lys Ser Tyr 
450 455 460 

ATC TTG CCC TAT CGA TGG GGT TCC TTG CCA GGA GTT AGT CAT 152 6 

20 lie Leu Pro Tyr Arg Trp Gly Ser Leu Pro Gly Val Ser His 
465 470 475 

GGT GAT GAT TTA GGA TAT CTT TTT GCA AAC TCG TTG GAT GTT 1568 
Gly Asp Asp Leu Gly Tyr Leu Phe Ala Asn Ser Leu Asp Val 
480 485 490 

25 CCT ATT TTG GGA ACA ACG CAC ATT TCT ATA CCG CAA GAT GCT 1610 
Pro lie Leu Gly Thr Thr His lie Ser lie Pro Gin Asp Ala 
495 500 

ATG CAG ACT CTG GAA AGG ATG GTC AGG ATC TGG ACC AAT TTT 1652 
Met Gin Thr Leu Glu Arg Met Val Arg lie Trp Thr Asn Phe 
30 505 510 515 

GTA AAG AAT GGA AAA CCT ACA TCA AAC ACT GAA GAT GCA TCA 16 94 

Val Lys Asn Gly Lys Pro Thr Ser Asn Thr Glu Asp Ala Ser 
520 525 530 

TGT GAT ACA AAA AGA CAT TTA AAC GAC ATT TTT TGG GAA CCA 173 6 

35 Cys Asp Thr Lys Arg His Leu Asn Asp lie Phe Trp C\v Pro 
535 540 545 

TAC AAC GAC GAA GAA CCA AAA TAT TTG GAC ATG GGA AAA GAA 177 8 

Tyr Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly Lys Glu 
550 555 560 

40 AAT TTT GAA ATG AAA AAT ATT TTG GAA CTA AAA CGC ATG ATG 18 2 0 

Asn Phe Glu Met Lys Asn lie Leu Glu Leu Lys Arg Met Met 
565 570 
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CTT TGG GAT GAA GTT TAT AGA AAT GCG AAT TTG CGG TTT AGA 1862 
Leu Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg Phe Arg 
575 580 585 

GTC TGT AAT GAA GAA AGT ATT AGA TGA GTTTTTTTAA 1899 
5 Val Cys Asn Glu Glu Ser lie Arg 
590 595 

TTTTACATAC AGCCGAGAGG AAACATGACT AAAATTGGAA AGAAAAATCA 1949 

GAAAAAGAAA AATCACATGG ACCATAGTAA CTTTATTACA TGATTTAGTT 1999 

TCAAGTGTAT CAAGAAAACT TATTGCATCA AAGAAAATAT TATTTTGCCA 2 04 9 

10 AAATTCTTGG AAAAACACTT TTTATGACTG ACATGGCCCA TAATTGAAGC 2099 

TTTTTCTTCT TTTACCAAAT CGCCAAATTT TGTAGCGTCA GACACATTTA 214 9 

TTTATGACAT GGCAATTAAT GTGTTAAACA TTCAACTCTA TATTAAAAAT 2199 

GGTAGTATTT TCTAATAAGA AGGTTATATA AAAAGACTTG AAAATAATAA 2249 

GATTTGCTCG G CT AT AT AT A AAAACTTANC GTCTCGTTAT GCTAAACTTT 22 99 

15 TTTGATGGTA AAAATATGTT GATTTTCCTA ATAATCTAAG ATATTATATT 2 34 9 

TTAGATTAAA TTAAAATATG ATATTTTCAA TTAATTAATT TTAGTTTTAA 2 3 99 

ATGTACTATA TTTACCAGTA CTATGAAACT ATTTTAAATA TATTTTTTAT 244 9 

TACAATATTT ATTTCTCAAA AATGTTTAGT GTAACAAGAC CATTAAATTA 24 99 

GAGTTAATGT TGTAAATTAA ACTATTTTTT ATCTATCACA AC CG CTT AAT 2 54 9 

20 TGGTGCAAAG AAAAATTTTA CTGTGATAAT ATTTGACATT TACACAATAT 2 5 99 

TACGAATTGT AAACTCACAA TTATGTGAAT ATTGTTTTTT GTTAAAAAAA 2 64 9 

CATACATGAC TTTTCTATAT CATTTTATAT TACGGTGATA TGGATTAATG 2 6 99 

TCAACATGTA AAATACAAAT GCGGTTGTTA AAAATAATCT GTATTAAAAT 2 74 9 

TGTTATATAA AATCTGAATA AATGTACTTT TAAGTAAAAA AAAAAAAAAA 2 7 99 

25 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA 2836 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Leu Pro His Ser Ser Ala Leu Val Leu Phe Leu Phe Phe 
15 10 

35 Leu Phe Phe Leu Phe Thr Pro He Leu Cys He Leu Trp Asp 
15 20 25 

Asn Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn Gly He 
30 35 40 

Thr Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Arg Asn Asp 
40 45 50 55 



Val Tyr Cys Ser Tyr Leu Gly He Pro Tyr Ala Glu Pro Pro 
60 65 70 
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Phe Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro lie Ser Asn 
75 80 

Pro Lys Thr Gly Phe Val Gin Ala Arg Thr Leu Gly Asp Lys 
85 90 95 

5 Cys Phe Gin Glu Ser Leu lie Tyr Ser Tyr Ala Gly Ser Glu 
100 105 . HO 

Asp Cys Leu Tyr Leu Asn He Phe Thr Pro Glu Thr Val Asn 
115 120 125 

Ser Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp He His 
10 130 135 140 

Gly Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn Phe Phe 
145 150 

Gly Pro Asp Tyr Leu He Arg Glu Gly He He Leu Val Thr 
155 160 165 

15 He Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser Ala Pro 
170 175 180 

Glu Trp Asp He His Gly Asn Met Gly Leu Lys Asp Gin Arg 
185 190 195 

Leu Ala Leu Lys Trp Val Tyr Asp Asn He Glu Lys Phe Gly 
20 200 205 210 

Gly Asp Arg Glu Lys He Thr He Ala Gly Glu Ser Ala Gly 
215 220 

Ala Ala Ser Val His Phe Leu Met Met Asp Asn Ser Thr Arg 
225 230 235 

25 Lys Tyr Tyr Gin Arg Ala He Leu Gin Ser Gly Thr Leu Leu 
240 245 250 

Asn Pro Thr Ala Asn Gin He Gin Leu Leu His Arg Phe Glu 
255 260 265 

Lys Leu Lys Gin Val Leu Asn He Thr Gin Lys Gin Glu Leu 
30 270 275 280 

jfJ Asn Leu Asp Lys Asn Leu He Leu Arg Ala Ala Leu Asn 
285 290 

Arg Val Pro Asp Ser Asn Asp His Asp Arg Asp Thr Val Pro 
295 300 305 

35 Val Phe Asn Pro Val Leu Glu Ser Pro Glu Ser Pro Asp Pro 
310 315 320 
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He Thr Phe Pro Ser Ala Leu Glu Arg Met Arg Asn Gly Glu 
325 330 335 

Phe Pro Asp Val Asp Val He He Gly Phe Asn Ser Ala Glu 
340 345 350 

5 Gly Leu Arg Ser Met Ala Arg Val Thr Arg Gly Asn Met Glu 

355 360 

Val His Lys Thr Leu Thr Asn He Glu Arg Ala He Pro Arg 
365 370 375 

Asp Ala Asn He Trp Lys Asn Pro Asn Gly He Glu Glu Lys 
10 380 385 390 

Lys Leu He Lys Met Leu Thr Glu Phe Tyr Asp Gin Val Lys 
395 400 405 

Glu Gin Asn Asp Asp He Glu Ala Tyr Val Gin Leu Lys Gly 
410 415 420 

15 Asp Ala Gly Tyr Leu Gin Gly He Tyr Arg Thr Leu Lys Ala 

425 430 

He Phe Phe Asn Glu Phe Arg Arg Asn Ser Asn Leu Tyr Leu 
435 440 445 

Tyr Arg Leu Ser Asp Asp Thr Tyr Ser Val Tyr Lys Ser Tyr 
20 450 455 460 

He Leu Pro Tyr Arg Trp Gly Ser Leu Pro Gly Val Ser His 
465 470 475 

Gly Asp Asp Leu Gly Tyr Leu Phe Ala Asn Ser Leu Asp Val 
480 485 490 

25 Pro He Leu Gly Thr Thr His He Ser He Pro Gin Asp Ala 

495 500 

Met Gin Thr Leu Glu Arg Met Val Arg He Trp Thr Asn Phe 
505 510 515 

Val Lys Asn Gly Lys Pro Thr Ser Asn Thr Glu Asp Ala Ser 
30 520 525 530 

Cys Asp Thr Lys Arg His Leu Asn Asp He Phe Trp Glu Pro 
535 540 545 

Tyr Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly Lys Glu 
550 555 560 



35 Asn Phe Glu Met Lys Asn He Leu Glu Leu Lys Arg Met Met 

565 570 
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Leu Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg Phe Arg 
575 580 585 

Val Cys Asn Glu Glu Ser lie Arg 
590 595 

5 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2836 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TTTTTTTTTT TTTTTTTTTT ttttttxttt ttttttxttx tttttttttt 50 

TTACTTAAAA GTACATTTAT TCAGATTTTA TATAACAATT TTAATACAGA 10 0 

15 TTATTTTTAA CAACCGCATT TGTATTTTAC ATGTTGACAT TAATCCATAT 150 

CACCGTAATA TAAAATGATA TAGAAAAGTC ATGTATGTTT TTTTAACAAA 2 00 

AAACAATATT CACATAATTG TGAGTTTACA ATTCGTAATA TTGTGTAAAT 2 50 

GTCAAATATT ATCACAGTAA AATTTTTCTT TGCACCAATT AAGCGGTTGT 3 00 

GATAGATAAA AAATAGTTTA ATTTACAACA TTAACTCTAA TTTAATGGTC 3 50 

20 TTGTTACACT AAACATTTTT GAGAAATAAA TATTGTAATA AAAAATATAT 4 00 

TTAAAATAGT TTCATAGTAC TGGTAAATAT AGTACATTTA AAACTAAAAT 450 

TAATTAATTG AAAATATCAT ATTTTAATTT AATCTAAAAT ATAATATCTT 500 

AGATTATTAG GAAAATCAAC ATATTTTTAC CATCAAAAAA GTTTAGCATA 5 50 

ACGAGACGNT AAGTTTTTAT ATATAGCCGA GCAAATCTTA TTATTTTCAA 6 00 

25 GTCTTTTTAT ATAACCTTCT TATTAGAAAA TACTACCATT TTTAATATAG 650 

AGTTGAATGT TTAACACATT AATTGCCATG TCATAAATAA ATGTGTCTGA 7 00 

CGCTACAAAA TTTGGCGATT TGGTAAAAGA AGAAAAAGCT TCAATTATGG 75 0 

GCCATGTCAG TCATAAAAAG TGTTTTTCCA AGAATTTTGG CAAAATAATA 8 00 

TTTTCTTTGA TGCAATAAGT TTTCTTGATA CACTTGAAAC TAAATCATGT 85 0 

30 AATAAAGTTA CTATGGTCCA TGTGATTTTT CTTTTTCTGA TTTTTCTTTC 900 

CAATTTTAGT CATGTTTCCT CTCGGCTGTA TGTAAAATTA AAAAAACTCA 95 0 

TCTAATACTT TCTTCATTAC AGACTCTAAA CCGCAAATTC GCATTTCTAT 10 00 

AAACTTCATC CCAAAGCATC ATGCGTTTTA GTTCCAAAAT ATTTTTCATT 105 0 

TCAAAATTTT CTTTTCCCAT GTCCAAATAT TTTGGTTCTT CGTCGTTGTA 1100 

35 TGGTTCCCAA AAAATGTCGT TTAAATGTCT TTTTGTATCA C ATG ATG CAT 1150 

CTTCAGTGTT TGATGTAGGT TTTCCATTCT TTACAAAATT GGTCCAGATC 12 00 

CTGACCATCC TTTCCAGAGT CTGCATAGCA TCTTGCGGTA TAGAAATGTG 12 50 

CGTTGTTCCC AAAATAGGAA CATC C AACGA GTTTGCAAAA AGATATCCTA 13 00 

AATCAT^AfC ATGACTAACT CCTGGCAAGG AACCCCATCG VTAGGGCAAG 13 50 

40 ATATAACTTT TATATACACT ATACGTATCG TCTGATAAL.C TGTACAAATA 14 00 

CAAATTGGAA TTCCTTCTGA ATTCATTGAA AAATATGGCT TTCAAGGTAC 14 50 

GGTAGATTCC TTGGAGGTAA CCAGCATCGC CTTTTAGTTG GACGTAGGCT 1500 

TCAATGTCAT CGTTTTGTTC TTTCACTTGG TCATAAAACT CTGTAAGCAT 1550 

TTTTATTAGT TTTTTCTCCT CAATACCATT TGGATTTTTC CAAATATTAG 16 00 

45 CATCTCTAGG TATAGCCCTT TCTATATTTG TCAAAGTCTT GTGAACTTCC 165 0 

ATGTTTCCTC TGGTTACTCT TG C CAT AG AT CTTAAACCTT CAGCACTATT 17 00 

GAAACCAATG ATGACATCGA CATCAGGAAA TTCACCATTT CTCATTCTTT 17 50 

CCAAGGCAGA TGGAAATGTT ATTGGATCTG GAGATTCTGG TGATTCTAAG 18 0 0 
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ACTGGATTAA ATACTGGTAC TGTGTCTCGG TCATGGTCGT TGCTATCAGG 1850 

AACTCTGTTT AAGGCTGCTC GTAAAATTAG GTTTTTATCC AGGTTTAGGA 190 0 

GTTCTTGTTT TTGCGTGATG TTTAGCACTT GTTTGAGTTT TTCAAATCTA 195 0 

TGCAGAAGTT GAATTTGATT AGCAGTCGGA TTTAGTAATG TCCCACTCTG 2 000 

5 CAAAATGGCC CTTTGGTAGT ATTTTCTAGT CGAGTTGTCC ATCATCAGAA 2 05 0 

AATGGACACT TGCTGCTCCA GCAGATTCTC CAGCAATTGT AATTTTTTCT 210 0 

CTGTCTCCAC CAAACTTTTC GATGTTGTCG TAAACCCATT TTAGTGCCAA 2150 

TCTCTGGTCT TTTAGACCCA TATTTCCATG GATATCCCAT TCCGGCGCTG 22 00 

ATAGAAAACC GAAAACTCCT AATCTATAGT TGATAGTGAC CAAAATAATT 22 5 0 

10 CCTTCCCTGA TCAAATAATC AGGTCCAAAA AAATTATAAG ATCCTGATCC 230 0 

TTGGTTGAAT GCGCCTCCAT GGATCCAGAA CATTACAGGA TATTTTGTAT 23 50 

TGTTCG C AG A ATTAACAGTC TCTGGCGTGA ATATATTCAG ATATAAGCAA 24 0 0 

TCTTCGCTTC CTGCATAAGA ATATATTAGA CTTTCCTGGA AACATTTGTC 24 50 

TCCCAAAGTT CGAGCCTGTA CGAATCCTGT TTTTGGATTT GATATTGGTT 2500 

15 TTGGAGACTG AAATCGTAAT GGTCCAAAAG GCGGTTCGGC ATAAGGAATT 255 0 

CCCAAATAAG AACAATATAC ATCATTCCTA TGATCTTTAT ATCGGAACGG 2600 

TTTTCCTTCC GTGATCCCGT TAAATTGAAC TCTGCACAAA TGCTGATCTA 2650 

GGTTATCCCA TAGTATGCAC AAGATAGGTG TAAATAAGAA AAATAAAAAA 27 00 

AATAAAAATA AAACTAATGC ACTACTGTGA GGTAACATTT TTTATTGTGT 2750 

20 TTTTTAATGC ATTTTGGTTG CTTAATTGTT ATTATTTATC TCGTTTTGTT 2 800 

TATGATAAAA TAGACGTTTT GAAGACGACA TGTCTA 2 83 6 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1710 nucleotides 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) FEATURE: 

30 (A) NAME /KEY : CDS 

(B) LOCATION: 1..1710 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

TGG GAT AAC CTA GAT CAG CAT TTG TGC AGA GTT CAA TTT AAC 42 
Trp Asp Asn Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn 

35 i 5 io 

GGG ATC ACG GAA GGA AAA CCG TTC CGA TAT AAA GAT CAT AGG 84 
Gly He Thr Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Arg 
15 20 25 

AAT GAT GTA TAT TGT TCT TAT TTG GGA ATT CCT TAT GCC GAA 12 6 

40 Asn Asp Val Tyr Cys Ser Tyr Leu Gly He Pro Tyr Ala Glu 
30 35 40 

CCG CCT TTT GGA CCA TTA CGA TTT CAG TCT CCA AAA CCA ATA 168 
Pro Pro Phe Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro He 
45 50 55 
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TCA AAT CCA AAA ACA GGA TTC GTA CAG GCT CGA ACT TTG GGA 210 
Ser Asn Pro Lys Thr Gly Phe Val Gin Ala Arg Thr Leu Gly 
60 65 70 

GAC AAA TGT TTC CAG GAA AGT CTA ATA TAT TCT TAT GCA GGA 2 52 

5 Asp Lys Cys Phe Gin Glu Ser Leu He Tyr Ser Tyr Ala Gly 

75 80 

AGC GAA GAT TGC TTA TAT CTG AAT ATA TTC ACG CCA GAG ACT 2 94 

Ser Glu Asp Cys Leu Tyr Leu Asn He Phe Thr Pro Glu Thr 
85 90 95 

10 GTT AAT TCT GCG AAC AAT ACA AAA TAT CCT GTA ATG TTC TGG 3 36 

Val Asn Ser Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp 
100 105 HO 

ATC CAT GGA GGC GCA TTC AAC CAA GGA TCA GGA TCT TAT AAT 37 8' 

He His Gly Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn 
15 115 120 125 

TTT TTT GGA CCT GAT TAT TTG ATC AGG GAA GGA ATT ATT TTG 42 0 

Phe Phe Gly Pro Asp Tyr Leu He Arg Glu Gly He He Leu 
130 135 140 

GTC ACT ATC AAC TAT AGA TTA GGA GTT TTC GGT TTT CTA TCA . 4 62 
20 Val Thr He Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser 

145 150 

GCG CCG GAA TGG GAT ATC CAT GGA AAT ATG GGT CTA AAA GAC 5 04 

Ala Pro Glu Trp Asp He His Gly Asn Met Gly Leu Lys Asp 
155 160 165 

25 CAG AGA TTG GCA CTA AAA TGG GTT TAC GAC AAC ATC GAA AAG 54 6 

Gin Arg Leu Ala Leu Lys Trp Val Tyr Asp Asn He Glu Lys 
170 175 180 

TTT GGT GGA GAC AGA GAA AAA ATT ACA ATT GCT GGA GAA TCT 58 8 

Phe Gly Gly Asp Arg Glu Lys He Thr He Ala Gly Glu Ser 
30 185 * 190 195 

GCT GGA GCA GCA AGT GTC CAT TTT CTG ATG ATG GAC AAC TCG 63 0 

Ala Gly Ala Ala Ser Val His Phe Leu Met Met Asp Asn Ser 
200 205 210 

ACT AGA AAA TAC TAC CAA AGG GCC ATT TTG CAG AGT ^GC ACA 6 72 

35 Thr Arg Lys Tyr Tyr Gin Arg Ala He Leu Gin Ser Gly Thr 

215 220 

TTA CTA AAT CCG ACT GCT AAT CAA ATT CAA CTT CTG CAT AGA 714 
Leu Leu Asn Pro Thr Ala Asn Gin He Gin Leu Leu His Arg 
225 230 235 



40 TTT GAA AAA CTC AAA CAA GTG CTA AAC ATC ACG CAA AAA CAA 
Phe Glu Lys Leu Lys Gin Val Leu Asn He Thr Gin Lys Gin 
240 245 250 



756 
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GAA CTC CTA AAC CTG GAT AAA AAC CTA ATT TTA CGA GCA GCC 7 98 

Glu Leu Leu Asn Leu Asp Lys Asn Leu lie Leu Arg Ala Ala 
255 260 265 

TTA AAC AGA GTT CCT GAT AGC AAC GAC CAT GAC CGA GAC ACA 84 0 

5 Leu Asn Arg Val Pro Asp Ser Asn Asp His Asp Arg Asp Thr 
270 275 280 

GTA CCA GTA TTT AAT CCA GTC TTA GAA TCA CCA GAA TCT CCA 8 82 

Val Pro Val Phe Asn Pro Val Leu Glu Ser Pro Glu Ser Pro 
285 290 

10 GAT CCA ATA ACA TTT CCA TCT GCC TTG GAA AGA ATG AGA AAT 924 
Asp Pro lie Thr Phe Pro Ser Ala Leu Glu Arg Met Arg Asn 
295 300 305 

GGT GAA TTT CCT GAT GTC GAT GTC ATC ATT GGT TTC AAT AGT 966 
Gly Glu Phe Pro Asp Val Asp Val He He Gly Phe Asn Ser 
15 310 315 320 

GCT GAA GGT TTA AGA TCT ATG GCA AGA GTA ACC AGA GGA AAC 10 0 8 

Ala Glu Gly Leu Arg Ser Met Ala Arg Val Thr Arg Gly Asn 
325 330 335 

ATG GAA GTT CAC AAG ACT TTG ACA AAT ATA GAA AGG GCT ATA 1050 
20 Met Glu Val His Lys Thr Leu Thr Asn He Glu Arg Ala He 
340 345 350 

CCT AGA GAT GCT AAT ATT TGG AAA AAT CCA AAT GGT ATT GAG 1092 
Pro Arg Asp Ala Asn He Trp Lys Asn Pro Asn Gly He Glu 
355 360 

25 GAG AAA AAA CTA ATA AAA ATG CTT ACA GAG TTT TAT GAC CAA 1134 
Glu Lys Lys Leu He Lys Met Leu Thr Glu Phe Tyr Asp Gin 
365 370 375 

GTG AAA GAA CAA AAC GAT GAC ATT GAA GCC TAC GTC CAA CTA 1176 
Val Lys Glu Gin Asn Asp Asp He Glu Ala Tyr Val Gin Leu 
30 380 385 390 

AAA GGC GAT GCT GGT TAC CTC CAA GGA ATC TAC CGT ACC TTG 1218 
Lys Gly Asp Ala Gly Tyr Leu Gin Gly He Tyr Arg Thr Leu 
395 400 405 

AAA GCC ATA TTT TTC AAT GAA TTC AGA AGG AAT TCC AAT TTG 12 60 

35 Lys Ala He Phe Phe Asn ^ a Phe Arg Arg Asn Ser Asn Leu 
410 415 420 

TAT TTG TAC AGG TTA TCA GAC GAT ACG TAT AGT GTA TAT AAA 13 02 

Tyr Leu Tyr Arg Leu Ser Asp Asp Thr Tyr Ser Val Tyr Lys 
425 430 

40 AGT TAT ATC TTG CCC TAT CGA TGG GGT TCC TTG CCA GGA GTT 134 4 

Ser Tyr He Leu Pro Tyr Arg Trp Gly Ser Leu Pro Gly Val 
435 440 445 
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AGT CAT GGT GAT GAT TTA GGA TAT CTT TTT GCA AAC TCG TTG 13 86 

Ser His Gly Asp Asp Leu Gly Tyr Leu Phe Ala Asn Ser Leu 
450 455 460 

GAT GTT CCT ATT TTG GGA ACA ACG CAC ATT TCT ATA CCG CAA 142 8 

5 Asp Val Pro He Leu Gly Thr Thr His He Ser He Pro Gin 
465 470 475 

GAT GCT ATG CAG ACT CTG GAA AGG ATG GTC AGG ATC TGG ACC 14 7 0 

Asp Ala Met Gin Thr Leu Glu Arg Met Val Arg He Trp Thr 
480 485 490 

10 AAT TTT GTA AAG AAT GGA AAA CCT ACA TCA AAC ACT GAA GAT 1512 
Asn Phe Val Lys Asn Gly Lys Pro Thr Ser Asn Thr Glu Asp 
495 500 

GCA TCA TGT GAT ACA AAA AG A CAT TTA AAC GAC ATT TTT TGG 15 54 

Ala Ser Cys Asp Thr Lys Arg His Leu Asn Asp He Phe Trp 
15 505 510 515 

GAA CCA TAC AAC GAC GAA GAA CCA AAA TAT TTG GAC ATG GGA 15 96 

Glu Pro Tyr Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly 
520 525 530 

AAA GAA AAT TTT GAA ATG AAA AAT ATT TTG GAA CTA AAA CGC 163 8 

20 Lys Glu Asn Phe Glu Met Lys Asn He Leu Glu Leu Lys Arg 
535 540 545 

ATG ATG CTT TGG GAT GAA GTT TAT AGA AAT GCG AAT TTG CGG 1680 
Met Met Leu Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg 
550 555 560 

25 TTT AGA GTC TGT AAT GAA GAA AGT ATT AGA 1710 
Phe Arg Val Cys Asn Glu Glu Ser He Arg 
565 570 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS : 

30 (A) LENGTH: 1788 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

35 (-i ) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1788 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ATG TTA CCT CAC AGT AGT GCA TTA GTT TTA TTT TTA TTT TTT 4 2 

40 Met Leu Pro His Ser Ser Ala Leu Val Leu Phe Leu Phe Phe 
15 10 
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TTA TTT TTC TTA TTT ACA CCT ATC TTG TGC ATA CTA TGG GAT 84 
Leu Phe Phe Leu Phe Thr Pro He Leu Cys He Leu Trp Asp 
15 20 25 

AAC CTA GAT CAG CAT TTG TGC AGA GTT CAA TTT AAC GGG ATC 126 
5 Asn Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn Gly He 
30 35 40 

ACQ GAA GGA AAA CCG TTC CGA TAT AAA GAt CAT AGG AAT GAT 168 
Thr Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Arg Asn Asp 
45 50 55 

10 GTA TAT TGT TCT TAT TTG GGA ATT CCT TAT GCC GAA CCG CCT 210 
Val Tyr Cys Ser Tyr Leu Gly He Pro Tyr Ala Glu Pro Pro 
60 65 70 

TTT GGA CCA TTA CGA TTT CAG TCT CCA AAA CCA ATA TCA AAT 252 
Phe Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro He Ser Asn 
15 75 80 

CCA AAA ACA GGA TTC GTA CAG GCT CGA ACT TTG GGA GAC AAA 2 94 

Pro Lys Thr Gly Phe Val Gin Ala Arg Thr Leu Gly Asp Lys 
85 9° 95 

TGT TTC CAG GAA AGT CTA ATA TAT TCT TAT GCA GGA AGC GAA 336 ■ 

20 Cys Phe Gin Glu Ser Leu He Tyr Ser Tyr Ala Gly Ser Glu 
100 105 HO 

GAT TGC TTA TAT CTG AAT ATA TTC ACG CCA GAG ACT GTT AAT 37 8 

Asp Cys Leu Tyr Leu Asn He Phe Thr Pro Glu Thr Val Asn 
115 120 I 25 

25 TCT GCG AAC AAT ACA AAA TAT CCT GTA ATG TTC TGG ATC CAT 42 0 

Ser Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp He His 
130 135 140 

GGA GGC GCA TTC AAC CAA GGA TCA GGA TCT TAT AAT TTT TTT 462 
Gly Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn Phe Phe 
30 145 150 

GGA CCT GAT TAT TTG ATC AGG GAA GGA ATT ATT TTG GTC ACT 504 
Gly Pro Asp Tyr Leu He Arg Glu Gly He He Leu Val Thr 
155 160 165 

ATC AAC TAT AGA TTA GGA GTT TTC GGT TTT CTA TCA GCG CCG 546 
35 He Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser Ala Pro 
170 175 180 

GAA TGG GAT ATC CAT GGA AAT ATG GGT CTA AAA GAC CAG AGA 588 
Glu Trp Asp He His Gly Asn Met Gly Leu Lys Asp Gin Arg 
185 190 195 

40 TTG GCA CTA AAA TGG GTT TAC GAC AAC ATC GAA AAG TTT GGT 630 
Leu Ala Leu Lys Trp Val Tyr Asp Asn He Glu Lys Phe Gly 
200 205 210 
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GGA GAC AGA GAA 
Gly Asp Arg Glu 



GCA GCA AGT GTC 
5 Ala Ala Ser Val 
225 

AAA TAC TAC CAA 
Lys Tyr Tyr Gin 
240 

10 AAT CCG ACT GCT 
Asn Pro Thr Ala 
255 

AAA CTC AAA CAA 
Lys Leu Lys Gin 
15 270 

CTA AAC CTG GAT 
Leu Asn Leu Asp 



AGA GTT CCT GAT 
20 Arg Val Pro Asp 
295 

GTA TTT AAT CCA 
Val Phe Asn Pro 
310 

25 ATA ACA TTT CCA 
lie Thr Phe Pro 
325 

TTT CCT GAT GTC 
Phe Pro Asp Val 
• 30 340 

GGT TTA AGA TCT 
Gly Leu Arg Ser 

GTT CAC AAG ACT 
35 Val His Lys Thr 
365 

GAT GCT AAT ATT 
Asp Ala Asn lie 
380 

40 AAA CTA ATA AAA 
Lys Leu lie Lys 
395 



AAA ATT ACA ATT GCT 
Lys lie Thr He Ala 
215 

CAT TTT CTG ATG ATG 
His Phe Leu Met Met 
230 

AGG GCC ATT TTG CAG 
Arg Ala He Leu Gin 
245 

AAT CAA ATT CAA CTT 
Asn Gin He Gin Leu 
260 

GTG CTA AAC ATC ACG 
Val Leu Asn He Thr 
275 

AAA AAC CTA ATT TTA 
Lys Asn Leu He Leu 
285 

AGC AAC GAC CAT GAC 
Ser Asn Asp His Asp 
300 

GTC TTA GAA TCA CCA 
Val Leu Glu Ser Pro 
315 

TCT GCC TTG GAA AGA 
Ser Ala Leu Glu Arg 
330 

GAT GTC ATC ATT GGT 
Asp Val He He Gly 
345 

ATG GCA AGA GTA ACC 
Met Ala Arg Val Thr 
355 

TTG ACA AAT ATA GAA 
Leu Thr Asn He Glu 
370 

TGG AAA AAT CCA AAT 
Trp Lys Asn Pro Asn 
385 

ATG CTT ACA GAG TTT 
Met Leu Thr Glu Phe 
400 



GGA GAA TCT GCT GGA 
Gly Glu Ser Ala Gly 
220 

GAC AAC TCG ACT AGA 
Asp Asn Ser Thr Arg 
235 

AGT GGG ACA TTA CTA 
Ser Gly Thr Leu Leu 
250 

CTG CAT AGA TTT GAA 
Leu His Arg Phe Glu 
265 

CAA AAA CAA GAA CTC 
Gin Lys Gin Glu Leu 
280 

CGA GCA GCC TTA AAC 
Arg Ala Ala Leu Asn 
290 

CGA GAC ACA GTA CCA 
Arg Asp Thr Val Pro 
305 

GAA TCT CCA GAT CCA 
Glu Ser Pro Asp Pro 
320 

ATG AGA AAT GGT GAA 
Met Arg Asn Gly Glu 
335 

TTC AAT AGT GCT GAA 
Phe Asn Ser Ala Glu 
350 

AGA GGA AAC ATG GAA 
Arg Gly Asn Met Glu 
360 

AGG C~T ATA CC7 AGA 
Arg Ala He ^ro Arg 
375 

GGT ATT GAG GAG AAA 
Gly He Glu Glu Lys 
390 

TAT GAC CAA GTG AAA 
Tyr Asp Gin Val Lys 
405 



FC-1-C1-PCT 

672 
714 
756 
798 

840 

882 

924 

966 

1008 

1050 

1092 

1134 

1176 
1218 
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GAA CAA AAC GAT GAC ATT GAA GCC TAC GTC CAA CTA AAA GGC 1260 
Glu Gin Asn Asp Asp lie Glu Ala Tyr Val Gin Leu Lys Gly 
41 P 415 420 

GAT GCT GGT TAC CTC CAA GGA ATC TAC CGT ACC TTG AAA GCC 13 02 

5 Asp Ala Gly Tyr Leu Gin Gly He Tyr Arg Thr Leu Lys Ala 

425 430 

ATA TTT TTC AAT GAA TTC AGA AGG AAT TCC AAT TTG TAT TTG 1344 
He Phe Phe Asn Glu Phe Arg Arg Asn Ser Asn Leu Tyr Leu 
435 440 445 

10 TAC AGG TTA TCA GAC GAT ACG TAT AGT GTA TAT AAA AGT TAT 13 86 

Tyr Arg Leu Ser Asp Asp Thr Tyr Ser Val Tyr Lys Ser Tyr 
450 455 460 

ATC TTG CCC TAT CGA TGG GGT TCC TTG CCA GGA GTT AGT CAT 14 2 8 

He Leu Pro Tyr Arg Trp Gly Ser Leu Pro Gly Val Ser His 
15 465 * 470 475 

GGT GAT GAT TTA GGA TAT CTT TTT GCA AAC TCG TTG GAT GTT 14 7 0 

Gly Asp Asp Leu Gly Tyr Leu Phe Ala Asn Ser Leu Asp Val 
480 485 490 

CCT ATT TTG GGA ACA ACG CAC ATT TCT ATA CCG CAA GAT GCT 1512 
20 Pro lie Leu Gly Thr Thr His He Ser He Pro Gin Asp Ala 

495 500 

ATG CAG ACT CTG GAA AGG ATG GTC AGG ATC TGG ACC AAT TTT 15 54 

Met Gin Thr Leu Glu Arg Met Val Arg He Trp Thr Asn Phe 
505 510 515 

25 GTA AAG AAT GGA AAA CCT ACA TCA AAC ACT GAA GAT GCA TCA 15 96 

Val Lys Asn Gly Lys Pro Thr Ser Asn Thr Glu Asp Ala Ser 
520 525 530 

TGT GAT ACA AAA AGA CAT TTA AAC GAC ATT TTT TGG GAA CCA 163 8 

Cys Asp Thr Lys Arg His Leu Asn Asp He Phe Trp Glu Pro 
30 *" 535 540 545 

TAC AAC GAC GAA GAA CCA AAA TAT TTG GAC ATG GGA AAA GAA 16 8 0 

Tyr Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly Lys Glu 
550 555 560 

AAT TTT GAA ATG ^ v AAT AT" TTG GAA CTA AAA CGC ATG ATG 17 2 2 

35 Asn Phe Glu Met Lys Asn Xle Leu Glu Leu Lys Arg Met Met 

565 570 

CTT TGG GAT GAA GTT TAT AGA AAT GCG AAT TTG CGG TTT AGA 17 64 

Leu Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg Phe Arg 
57 5 ~ ~ 580 585 



40 GTC TGT AAT GAA GAA AGT ATT AGA 
Val Cys Asn Glu Glu Ser He Arg 
590 595 



1788 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 nucleotides 

(B) TYPE: nucleic acid 
5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TCTAATACTT TCTTCATTAC AGACTCTAAA CCGCAAATTC GCATTTCTAT 5 0 

10 AAACTTCATC CCAAAGCATC ATGCGTTTTA GTTCCAAAAT ATTTTTCATT 100 
TCAAAATTTT CTTTTCCCAT GTCCAAATAT TTTGGTTCTT CGTCGTTGTA 150 
TGGTTCCCAA AAAATGTCGT TTAAATGTCT TTTTGTATCA CATGATGCAT 
CTTCAGTGTT TGATGTAGGT TTTCCATTCT TTACAAAATT GGTCCAGATC 
CTGACCATCC TTTCCAGAGT CTGCATAGCA TCTTGCGGTA TAGAAATGTG 
15 CGTTGTTCCC AAAATAGGAA CATCCAACGA GTTTGCAAAA AGATATCCTA 
AATCATCACC ATGACTAACT CCTGGCAAGG AACCCCATCG ATAGGGCAAG 
ATATAACTTT TATATACACT ATACGTATCG TCTGATAACC TGTACAAATA 
CAAATTGGAA TTCCTTCTGA ATTCATTGAA AAATATGGCT TTCAAGGTAC 
GGTAGATTCC TTGGAGGTAA CCAGCATCGC CTTTTAGTTG GACGTAGGCT 
20 TCAATGTCAT CGTTTTGTTC TTTCACTTGG TCATAAAACT CTGTAAGCAT 
TTTTATTAGT TTTTTCTCCT CAATACCATT TGGATTTTTC CAAATATTAG 
CATCTCTAGG TATAGCCCTT TCTATATTTG TCAAAGTCTT GTGAACTTCC 
ATGTTTCCTC TGGTTACTCT TG C CAT AG AT CTTAAACCTT CAGCACTATT 
GAAACCAATG ATGACATCGA CATCAGGAAA TTCACCATTT CTCATTCTTT 
25 CCAAGGCAGA TGGAAATGTT ATTGGATCTG GAGATTCTGG TGATTCTAAG 
ACTGGATTAA ATACTGGTAC TGTGTCTCGG TCATGGTCGT TGCTATCAGG 
AACTCTGTTT AAGGCTGCTC GTAAAATTAG GTTTTTATCC AGGTTTAGGA 
GTTCTTGTTT TTGCGTGATG TTTAGCACTT GTTTGAGTTT TTCAAATCTA 
TGCAGAAGTT GAATTTGATT AGCAGTCGGA TTTAGTAATG TCCCACTCTG 
30 CAAAATGGCC CTTTGGTAGT ATTTTCTAGT CGAGTTGTCC ATCATCAGAA 
AATGGACACT TGCTGCTCCA GCAGATTCTC CAGCAATTGT AATTTTTTCT 
CTGTCTCCAC CAAACTTTTC GATGTTGTCG TAAACCCATT TTAGTGCCAA 
TCTCTGGTCT TTTAGACCCA TATTTCCATG GATATCCCAT TCCGGCGCTG 
AT AG AAAAC C GAAAACTCCT AATCTATAGT TGATAGTGAC CAAAATAATT 
35 CCTTCCCTGA TCAAATAATC AGGTCCAAAA AAATTATAAG ATCCTGATCC 13 50 

TTGGTTGAAT GCGCCTCCAT GGATCCAGAA CATTACAGGA TATTTTGTAT 14nn 
TGTTCGCAGA ATTAACAGTC TCTGGCGTGA ATATATTCAG ATATAAGCAA 
TCTTCGCTTC CTGCATAAGA ATATATTAGA CTTTCCTGGA AACATTTGTC 
TCCCAAAGTT CGAGCCTGTA CGAATCCTGT TTTTGGATTT GATATTGGTT 
40 TTGGAGACTG AAATCGTAAT GGTC C AAAAG GCGGTTCGGC ATAAGGAATT 
CCCAAATAAG AACAATATAC ATCATTCCTA TGATCTTTAT ATCGGAACGG 
T TTCCTTCC GTGATCCCGT TAAATTGAAC TCTGCACAAA TGCTGATCTA 
GGTTATCCCA TAGTATGCAC AAGATAGGTG TAAATAAGAA AAATAAAAAA 
AATAAAAATA AAACTAATGC ACTACTGTGA GGTAACAT 

45 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2801 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



200 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

850 

900 

950 

1000 

1050 

1100 

1150 

1200 

. 1250 
1300 



1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
1788 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) FEATURE: 

(A) NAME /KEY : CDS 
5 (B) LOCATION: 99.. 1886 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GACATGTCGT CTTCAAAACG TCTATTTTAT CATAAACAAA ACGAGATAAA 5 0 

TAATAACAAT TAAGCATCCA AAATGCATTA AAAAAAACAT CATAAAAA 9 8 

ATG TTA CCT CAC AGT GCA TTA GTT TTA TTT TTA TTT TTT TTA 14 0 

10 Met Leu Pro His Ser Ala Leu Val Leu Phe Leu Phe Phe Leu 
1 5 10 

TTT TTC TTA TTT ACA CCT GTC TTG TGC ATA CTA TGG GAT AAC 182 
Phe Phe Leu Phe Thr Pro Val Leu Cys He Leu Trp Asp Asn 
15 20 25 

15 CTA GAT CAG CAT TTG TGC AGA GTT CAA TTT AAC GGG ATC ACG 224 
Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn Gly He Thr 
30 35 40 

GAA GGA AAA CCG TTC CGA TAT AAA GAT CAT AAA AAT GAT GTA 266 
Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Lys Asn Asp Val 
20 45 50 55 

TAT TGT TCC TAT TTG GGA ATT • CCT TAT GCA GAA CCG CCT ATT 308 
Tyr Cys Ser Tyr Leu Gly He Pro Tyr Ala Glu Pro Pro He 
60 65 70 

GGA CCA TTG CGA TTT CAG TCT CCA AAA CCA ATA TCA AAT CCA 350 
25 Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro He Ser Asn Pro 

75 8° 

AAA ACA GGA TTC GTT CAG GCT CGG TCT TTA GGA GAC AAA TGT 3 92 

Lys Thr Gly Phe Val Gin Ala Arg Ser Leu Gly Asp Lys Cys 
85 9° 95 

30 TTC CAG GAA AGT CTA ATA TAT TCT TAT GCA GGA AGC GAA GAT 4 34 

Phe Gin Glu Ser Leu He Tyr Ser Tyr Ala Gly Ser Glu Asp 
100 105 11° 

TGC TTA TAT CTG AAT ATA TTC ACG CCA GAG ACT GTT AAT TCT 4 76 

Cys Leu Tyr Leu Asn He Phe Thr Pro Glu Thr Val Asn Ser 
35 * H5 120 125 



GCG AAC AAT ACA AAA TAT CCT GTA ATG TTC TGG ATC CAT GGA 
Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp He His Gly 
130 135 140 

GGC GCA TTC AAC CAA GGA TCA GGA TCT TAT AAT TTT TTT GGA 
40 Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn Phe Phe Gly 

145 150 



518 



560 
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CCT GAT TAT TTG ATC AGG GAA GGA ATT ATT TTG GTC ACT ATC 602 
Pro Asp Tyr Leu lie Arg Glu Gly He He Leu Val Thr He 
155 160 165 

AAC TAT AG A TTA GGA GTT TTC GGT TTT CTA TCA GCG CCG GAA 644 
5 Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser Ala Pro Glu 
170 175 180 

TGG GAT ATC CAT GGA AAT ATG GGT CTA AAA GAC CAG AGA TTG 686 
Trp Asp He His Gly Asn Met Gly Leu Lys Asp Gin Arg Leu 
185 190 195 

10 GCA CTA AAA TGG GTT TAT GAC AAC ATC GAA AAA TTT GGT GGA 72 8 

Ala Leu Lys Trp Val Tyr Asp Asn He Glu Lys Phe Gly Gly 
200 205 210 

GAC AGA GAT AAA ATC ACT ATA GCT GGA GAA TCT GCT GGA GCA 77 0 

Asp Arg Asp Lys He Thr He Ala Gly Glu Ser Ala Gly Ala 
15 - 220 

GCA AGT GTT CAT TTT CTG ATG ATG GAC AAT TCT ACT AGA AAA 812 
Ala Ser Val His Phe Leu Met Met Asp Asn Ser Thr Arg Lys 
225 230 235 

TAC TAC CAA AGG GCA ATT TTG CAG AGT GGG ACA TTA CTC AAT 8 54 

20 Tyr Tyr Gin Arg Ala He Leu Gin Ser Gly Thr Leu Leu Asn 
240 245 250 

CCG ACT GCT AAT CAA ATT CAA CCT CTG CAT AGA TTT GAA AAA 896 
Pro Thr Ala Asn Gin He Gin Pro Leu His Arg Phe Glu Lys 
255 260 265 

25 CTA AAA CAA GTG CTG AAC ATC ACG CAA AAA CAA GAA CTC CTA 93 8 

Leu Lys Gin Val Leu Asn He Thr Gin Lys Gin Glu Leu Leu 
270 275 280 

AAT CTG GAC AAA AAT CAA ATT TTG CGA GCA GCC TTA AAC AGA 980 
Asn Leu Asp Lys Asn Gin He Leu Arg Ala Ala Leu Asn Arg 
30 285 290 

GTC CCA GAT AAC AAC GAC CAC GAA AGG GAC ACA GTA CCA GTA 102 2 

Val Pro Asp Asn Asn Asp His Glu Arg Asp Thr Val Pro Val 
295 * 300 305 

TTT AAT CCA GTC CTA GAA TCA CCA GPA JCT CCA '5AC CCA ATA 1064 
35 Phe Asn Pro Val Leu Glu Ser Pro Glu Ser P„o Asp Pro He 
310 315 320 

ACA TTT CCA TCT GCT TTA GAA AGA ATG AGA AAT GGT GAA TTT 1106 
Thr Phe Pro Ser Ala Leu Glu Arg Met Arg Asn Gly Glu Phe 
325 330 335 

40 CCT GAC GTT GAT GTC ATC ATT GGA TTC AAT AGT GCT GAA GGT 114 8 

Pro Asp Val Asp Val He He Gly Phe Asn Ser Ala Glu Gly 
340 345 350 
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TTA AGA 
Leu Arg 



TAC AAG 
5 Tyr Lys 
365 

GCT AAT 
Ala Asn 
380 

10 CTT ATA 
Leu lie 



CAA AAC 
Gin Asn 

15 

GCT GGT 
Ala Gly 



TTT TTC 
20 Phe Phe 
435 

AGG TTA 
Arg Leu 
450 

25 TTG CCC 
Leu Pro 



GAT GAT 
Asp Asp 

30 

ATT TTG 
lie Leu 



CAG ACT 
35 Gin Thr 
505 

AAG AAT 
Lys Asn 
520 

40 GAT ACA 
Asp Thr 



TCT ATG CCA 
Ser Met Pro 
355 

ACT TTG ACA 
Thr Leu Thr 



ATT TGG AAA 
lie Trp Lys 



AAA ATG CTT 
Lys Met Leu 
395 

GAT GAC ATC 
Asp Asp He 
410 

TAT CTC CAA 
Tyr Leu Gin 
425 

AAT GAA ATC 
Asn Glu He 



TCA GAT GAT 
Ser Asp Asp 



TAT CGA TGG 
Tyr Arg Trp 
465 

TTA GGA TAT 
Leu Gly Tyr 
480 

GGA ACA ACG 
Gly Thr Thr 
495 

CTG GAA AGG 
Leu Glu Arc, 



GGA AAA CCT 

Gly Lys Pro 



AAA AGA CAT 
Lys Arg His 
535 



AGA GTA ACC 
Arg Val Thr 



AAT ATA GAG 
Asn He Glu 
370 

AAT CCT AAT 
Asn Pro Asn 
385 

ACA GAG TTT 
Thr Glu Phe 
400 

GAA GCC TAT 
Glu Ala Tyr 



GGA ATT TAC 
Gly He Tyr 



AAA AGA AAT 
Lys Arg Asn 
440 

ACG TAT AGT 
Thr Tyr Ser 
455 

GGT TCC TTG 
Gly Ser Leu 
470 

CTT TTT GCA 
Leu Phe Ala 



CAC ATT TCT 
His He Ser 



ATG GTC AGG 
Met Val Arg 
510 

ACA TCA AAC 
Thr Ser Asn 
525 

TTA AAC GAC 
Leu Asn Asp 
540 



AGA GGA AAC 
Arg Gly Asn 
360 

AGA GCT ATA 
Arg Ala He 
375 

GGC ATT GAG 
Gly He Glu 



TAT GAC CAA 
Tyr Asp Gin 



GTC CAA CTA 
Val Gin Leu 
415 

CGT ACC TTG 
Arg Thr Leu 
430 

TCC AAC TTG 
Ser Asn Leu 
445 

GTA TAT AAA 
Val Tyr Lys 



CCA GGA GTT 
Pro Gly Val 

AAC TCT TTG 
Asn Ser Leu 
485 

ATA CCG CAA 
He Pro Gin 
500 

ATC TGG ACC 
He Trp Thr 
515 

ACT GAA GAT 
Thr Glu Asp 

ATT TTT TGG 
He Phe Trp 



ATG GAA GTT 
Met Glu Val 



CCT AGA GAT 
Pro Arg Asp 

GAG AAA AAA 
Glu Lys Lys 
390 

GTT AAA GAA 
Val Lys Glu 
405 

AAA GGC GAT 
Lys Gly Asp 
420 

AAA GCC ATA 
Lys Ala He 



TAT TTG TAT 
Tyr Leu Tyr 



AGT TAT ATC 
Ser Tyr He 
460 

AGT CAT GGT 
Ser His Gly 
475 

GAT GTT CCT 
Asp Val Pro 
490 

GAT GCT ATG 
Asp Ala Met 



AAT TTT GTA 
Asn Phe Val 



GCA TCA TGT 
Ala Ser Cys 
530 

GAA CCA TAC 
Glu Pro Tyr 
545 



1232 



1274 



1316 



1358 



1400 



1442 



1484 



1526 



1568 



1610 



1652 



1694 



1736 
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AAC GAC GAA GAA CCA AAA TAT TTG GAC ATG GGA AAA GAA CAT 177 8 

Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly Lys Glu His 
550 555 560 

TTT GAA ATG AAA AAT ATT TTG GAA CTA AAA CGC ATG ATG CTT 182 0 

5 Phe Glu Met Lys Asn lie Leu Glu Leu Lys Arg Met Met Leu 

565 570 

TGG GAT GAA GTT TAT AGA AAT GCG AAT TTG CGG TTT AGA GTC 1862 
Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg Phe Arg Val 
575 580 585 

10 TGT AAT GAA GAA AGT ATT AGA TGA GTTTTTTTAA TTTTACATAC 1906 
Cys Asn Glu Glu Ser lie Arg 
590 595 

AGCCGAGAGG AAACATGACT AAAATTGGAA AGAAAAATCA GAAAAAGAAA 195 6 

AATCACATGG AC CAT AGT AA CTTTATTACA TGATTTAGTT TCAAGTGTAT 2 00 6 

15 CAAGAAAACT TATTGCATCA AAGAAAATAT TATTTTGCCA AAATTCTTGG 2 05 6 

AAAAACACTT TTTATGACTG ACATGGCCCA TAATTGAAGC TTTTTCTTCT 210 6 

TTTACCAAAT CGCCAAATTT TGTAGCGTCA GACACATTTA TTTATGACAT 215 6 

GGCAATTAAT GTGTTAAACA TTCAACTCTA TATTAAAAAT GGTAGTATTT 2 2 06 

TCTAATAAGA AGGTTATATA AAAAGACTTG AAAATAATAA GATTTGCTCG 2256 

20 GCTATATATA AAAACTTANC GTCTCGTTAT GCTAAACTTT TTTGATGGTA 23 06 

AAAATATGTT GATTTTCCTA ATAATCTAAG ATATTATATT TTAGATTAAA 2 35 6 

TTAAAATATG ATATTTTCAA TTAATTAATT TTAGTTTTAA ATGTACTATA 24 06 

TTT AC C AGT A CTATGAAACT ATTTTAAATA TATTTTTTAT TACAATATTT 2 45 6 

ATTTCTCAAA AATGTTTAGT GTAACAAGAC CATTAAATTA GAGTTAATGT 2 50 6 

25 TGTAAATTAA ACTATTTTTT ATC TATC AC A ACCGCTTAAT TGGTGCAAAG 255 6 

AAAAATTTTA CTGTGATAAT ATTTGACATT TACACAATAT TACGAATTGT 2 60 6 

AAACTCACAA TTATGTGAAT ATTGTTTTTT GTTAAAAAAA CATACATGAC 2 65 6 

TTTTCTATAT CATTTTATAT TACGGTGATA TGGATTAATG TCAACATGTA 27 06 

AAATACAAAT GCGGTTGTTA AAAATAATCT GTATTAAAAT TGTTATATAA 2 75 6 

30 AATCTGAATA AATGTACTTT TAAGTAAAAA AAAAAAAAAA AAAAA 2 801 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 95 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Leu Pro His Ser Ala Leu Val Leu Phe Leu Phe Phe Leu 
15 10 

40 Phe Phe Leu Phe Thr Pro Val Leu Cys lie Leu Trp Asp Asn 
15 20 25 

Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn Gly He Thr 
30 35 40 



Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Lys Asn Asp Val 
45 45 50 55 
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Tvr Cys Ser Tyr Leu Gly He Pro Tyr Ala Glu Pro Pro He 
60 65 70 

Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro He Ser Asn Pro 
75 80 

5 Lys Thr Gly Phe Val Gin Ala Arg Ser Leu Gly Asp Lys Cys 
85 90 95 

Phe Gin Glu Ser Leu He Tyr Ser Tyr Ala Gly Ser Glu Asp 
100 105 HO 

Cys Leu Tyr Leu Asn He Phe Thr Pro Glu Thr Val Asn Ser 
10 ' H5 120 125 

Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp He His Gly 
130 135 140 

Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn Phe Phe Gly 
145 150 

15 Pro Asp Tyr Leu He Arg Glu Gly He He Leu Val Thr He 
155 " 160 165 

Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser Ala Pro Glu 
170 ~ 175 180 

Trp Asp He His Gly Asn Met Gly Leu Lys Asp Gin Arg Leu 
20 185 190 195 

Ala Leu Lys Trp Val Tyr Asp Asn He Glu Lys Phe Gly Gly 
200 205 210 

Asp Arg Asp Lys He Thr He Ala Gly Glu Ser Ala Gly Ala 
215 22 0 

25 Ala Ser Val His Phe Leu Met Met Asp Asn Ser Thr Arg Lys 
225 230 235 

Tyr Tyr Gin Arg Ala He Leu Gin Ser Gly Thr Leu Leu Asn 
240 245 250 

Pro Thr Ala Asn Gin He Gin Pro Leu His Arg Phe Glu Lys 
30 255 260 265 

Leu Lys Gin Val Leu Asn He Thr Gin Lys Gin Glu Leu Leu 
270 275 280 

Asn Leu Asp Lys Asn Gin He Leu Arg Ala Ala Leu Asn Arg 
285 290 

35 Val Pro Asp Asn Asn Asp His Glu Arg Asp Thr Val Pro Val 
295 300 305 

Phe Asn Pro Val Leu Glu Ser Pro Glu Ser Pro Asp Pro He 
310 315 320 
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Thr Phe Pro Ser Ala Leu Glu Arg Met Arg Asn Gly Glu Phe 
325 330 335 

Pro Asp Val Asp Val lie He Gly Phe Asn Ser Ala Glu Gly 
340 345 350 

5 Leu Arg Ser Met Pro Arg Val Thr Arg Gly Asn Met Glu Val 

355 360 

Tyr Lys Thr Leu Thr Asn He Glu Arg Ala He Pro Arg Asp 
365 370 375 

10 Ala Asn He Trp Lys Asn Pro Asn Gly He Glu Glu Lys Lys 
380 385 390 

Leu He Lys Met Leu Thr Glu Phe Tyr Asp Gin Val Lys Glu 
395 400 405 

Gin Asn Asp Asp He Glu Ala Tyr Val Gin Leu Lys Gly Asp 
15 410 415 420 

Ala Gly Tyr Leu Gin Gly He Tyr Arg Thr Leu Lys Ala He 
425 430 

Phe Phe Asn Glu He Lys Arg Asn Ser Asn Leu Tyr Leu Tyr 
435 440 445 

20 Arg Leu Ser Asp Asp Thr Tyr Ser Val Tyr Lys Ser Tyr He 
. 450 455 460 

Leu Pro Tyr Arg Trp Gly Ser Leu Pro Gly Val Ser His Gly 
465 470 475 

Asp Asp Leu Gly Tyr Leu Phe Ala Asn Ser Leu Asp Val Pro 
25 480 485 490 

He Leu Gly Thr Thr His He Ser He Pro Gin Asp Ala Met 
495 500 

Gin Thr Leu Glu Arg Met Val Arg He Trp Thr Asn Phe Val 
505 510 515 

30 Lys Asn Gly Lys Pro Thr Ser Asn Thr Glu Asp Ala Ser Cys 
520 525 530 

Asp Thr Lys Arg His Leu Asn Asp He r'he Trp Glu Pro Tyr 
535 545 545 

Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly Lys Glu His 
35 550 555 560 

Phe Glu Met Lys Asn He Leu Glu Leu Lys Arg Met Met Leu 
565 570 

Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg Phe Arg Val 
575 580 585 
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Cys Asn Glu Glu Ser lie Arg 
590 595 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

5 (A) LENGTH: 2 801 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 (iii) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

TTTTTTTTTT TTTTTTTTTT ACTTAAAAGT ACATTTATTC AGATTTTATA 5 0 

TAACAATTTT AATACAGATT ATTTTTAACA ACCGCATTTG TATTTTACAT 100 

GTTGACATTA ATCCATATCA CCGTAATATA AAATGATATA GAAAAGTCAT 150 

GTATGTTTTT TTAACAAAAA ACAATATTCA CATAATTGTG AGTTTACAAT 2 00 

15 TCGTAATATT GTGTAAATGT CAAATATTAT CACAGTAAAA TTTTTCTTTG 2 50 

CACCAATTAA GCGGTTGTGA TAGATAAAAA ATAGTTTAAT TTACAACATT 3 00 

AACTCTAATT TAATGGTCTT GTTACACTAA ACATTTTTGA GAAATAAATA 350 

TTGTAATAAA AAATATATTT AAAATAGTTT CATAGTACTG GTAAATATAG 4 00 

TACATTTAAA ACTAAAATTA ATTAATTGAA AATATCATAT TTTAATTTAA 45 0 

20 TCTAAAATAT AATATCTTAG ATTATTAGGA AAATCAACAT ATTTTTACCA 5 00 

XCAAAAAAGT TTAGCATAAC GAGACGNTAA GTTTTTATAT ATAGCCGAGC 55 0 

AAATCTTATT ATTTTCAAGT CTTTTTATAT AACCTTCTTA TTAGAAAATA 600 

CTACCATTTT TAATATAGAG TTGAATGTTT AACACATTAA TTGCCATGTC 65 0 

ATAAATAAAT GTGTCTGACG CTACAAAATT TGGCGATTTG GTAAAAGAAG 700 

25 AAAAAGCTTC AATTATGGGC CATGTCAGTC ATAAAAAGTG TTTTTCCAAG 75 0 

AATTTTGGCA AAATAATATT TTCTTTGATG CAATAAGTTT TCTTGATACA 800 

CTTGAAACTA AATCATGTAA TAAAGTTACT ATGGTCCATG TGATTTTTCT 85 0 

TTTTCTGATT TTTCTTTCCA ATTTTAGTCA TGTTTCCTCT CGGCTGTATG 90 0 

TAAAATTAAA AAAACTCATC TAATACTTTC TTCATTACAG ACTCTAAACC 950 

30 GCAAATTCGC ATTTCTATAA ACTTCATCCC AAAGCATCAT GCGTTTTAGT 10 00 

TCCAAAATAT TTTTCATTTC AAAATGTTCT TTTCCCATGT CCAAATATTT 10 50 

TGGTTCTTCG TCGTTGTATG GTTCCCAAAA AATGTCGTTT AAATGTCTTT 1100 

TTGTATCACA TGATGCATCT TCAGTGTTTG ATGTAGGTTT TCCATTCTTT 1150 

ACAAAATTGG TCCAGATCCT GACCATCCTT TCCAGAGTCT GCATAGCATC 12 00 
35 TTGCGGTATA GAAATGTGCG TTGTTCCCAA AATAGGAACA TCCAAAGAGT 12 5 0 

TTGCAAAAAG ATATCCTAAA TCATCACCAT GACTAACTCC TGGCAAGGAA 13 00 

CCCCATCGAT AGGGCAAGAT ATAACTTTTA TATACACTAT ACGTATCATC 13 50 

TGATAACCTA TACAAATACA AGTTGGAATT TCTTTTGATT TCATTGAAAA 14 00 

ATATGGCTTT CAAGGTACGG TAAATTCCTT GGAGATAACC AGCATCGCCT 14 5 0 

40 TTT7.GTTGGA C VTAGGCTTC GATGTCATCG TTTTGTTCTT TAACTTGGTC 15 0 0 

ATAAAACTC'i OTAAGCATTT TTATAAGTTT TTTCTCCTCA ATGCCATTAG 1550 
GATTTTTCCA AAT ATT AG C A TCTCTAGGTA TAGCTCTCTC TATATTTGTC 1600 
AAAGTCTTGT AAACTTCCAT GTTTCCTCTG GTTACTCTTG GCATAGATCT 1650 
TAAACCTTCA GCACTATTGA ATCCAATGAT GACATCAACG TCAGGAAATT 17 0 0 

45 CACCATTTCT CATTCTTTCT AAAGCAGATG GAAATGTTAT TGGGTCTGGA 17 5 0 

GATTCTGGTG ATTCTAGGAC TGGATTAAAT ACTGGTACTG TGTCCCTTTC 18 00 

GTGGTCGTTG TTATCTGGGA CTCTGTTTAA GGCTGCTCGC AAAATTTGAT 18 5 0 

TTTTGTCCAG ATTTAGGAGT TCTTGTTTTT GCGTGATGTT CAGCACTTGT 190 0 

TTTAGTTTTT CAAATCTATG CAGAGGTTGA ATTTGATTAG CAGTCGGATT 1950 
50 GAGTAATGTC CCACTCTGCA AAATTGCCCT TTGGTAGTAT TTTCTAGTAG 2 000 

AATTGTCCAT CATCAGAAAA TGAACACTTG CTGCTCCAGC AGATTCTCCA 2 05 0 

GCTATAGTGA TTTTATCTCT GTCTCCACCA AATTTTTCGA TGTTGTCATA 210 0 
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AACCCATTTT AGTGCCAATC TCTGGTCTTT TAGACCCATA TTTCCATGGA 2150 

TATCCCATTC CGGCGCTGAT AGAAAACCGA AAACTCCTAA TCTATAGTTG 22 00 

ATAGTGACCA AAATAATTCC TTCCCTGATC AAATAATCAG GTCCAAAAAA 2250 

ATTATAAGAT CCTGATCCTT GGTTGAATGC GCCTCCATGG ATCCAGAACA 2300 

5 TTACAGGATA TTTTGTATTG TTCGCAGAAT TAACAGTCTC TGGCGTGAAT 2350 

ATATTCAGAT ATAAGCAATC TTCGCTTCCT GCATAAGAAT ATATTAGACT 24 00 

TTCCTGGAAA CATTTGTCTC CTAAAGACCG AGCCTGAACG AATCCTGTTT 24 50 

TTGGATTTGA TATTGGTTTT GGAGACTGAA ATCGCAATGG TCCAATAGGC 2 500 

GGTTCTGCAT AAGGAATTCC CAAATAGGAA CAATATACAT CATTTTTATG 2550 

10 ATCTTTATAT CGGAACGGTT TTCCTTCCGT GATCCCGTTA AATTGAACTC 2 600 

TGCACAAATG CTGATCTAGG TTATC C CAT A GTATGCACAA GACAGGTGTA 265 0 

AATAAGAAAA ATAAAAAAAA TAAAAATAAA ACTAATGCAC TGTGAGGTAA 27 00 

CATTTTTTAT GATGTTTTTT TTAATGCATT TTGGATGCTT AATTGTTATT 2 750 

ATTTATCTCG TTTTGTTTAT GATAAAATAG ACGTTTTGAA GACGACATGT 2 800 

15 C 2801 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1710 nucleotides 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) FEATURE: 

(A) NAME/ KEY: CDS 
25 (B) LOCATION: 1..1710 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TGG GAT AAC CTA GAT CAG CAT TTG TGC AGA GTT CAA TTT AAC 4 2 

Trp Asp Asn Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn 
1 5 10 

30 GGG ATC ACG GAA GGA AAA CCG TTC CGA TAT AAA GAT CAT AAA 84 
Gly lie Thr Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Lys 
15 20 25 

AAT GAT GTA TAT TGT TCC TAT TTG GGA ATT CCT TAT GCA GAA 12 6 

Asn Asp Val Tyr Cys Ser Tyr Leu Gly lie Pro Tyr Ala Glu 
35 30 35 40 

CCG CCT ATT GGA CCA TTG CGA TTT CAG TCT CCA AAA CCA ATA 16 8 

Pro Pro lie Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro lie 
45 50 55 

TCA AAT CCA AAA AC A GGA TTC GTT CAG GCT CGG TCT TTA GGA 210 
40 Ser Asn Pro Lys Thr Gly Phe Val Gin Ala Arg Ser Leu- Gly 
60 65 70 

GAC AAA TGT TTC CAG GAA AGT CTA ATA TAT TCT TAT GCA GGA 2 52 

Asp Lys Cys Phe Gin Glu Ser Leu He Tyr Ser Tyr Ala Gly 
75 80 
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294 



336 



378 



420 



462 



504 



AGC GAA GAT TGC TTA TAT CTG AAT ATA TTC ACG CCA GAG ACT 
Ser Glu Asp Cys Leu Tyr Leu Asn lie Phe Thr Pro Glu Thr 
85 90 95 

GTT AAT TCT GCG AAC AAT ACA AAA TAT CCT GTA ATG TTC TGG 
5 val Asn Ser Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp 
100 105 110 

ATC CAT GGA GGC GCA TTC AAC CAA GGA TCA GGA TCT TAT AAT 
He His Gly Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn 
115 120 125 

10 TTT TTT GGA CCT GAT TAT TTG ATC AGG GAA GGA ATT ATT TTG 
Phe Phe Gly Pro Asp Tyr Leu He Arg Glu Gly lie lie Leu 
130 135 140 

GTC ACT ATC AAC TAT AGA TTA GGA GTT TTC GGT TTT CTA TCA 
Val Thr He Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser 
15 145 150 

GCG CCG GAA TGG GAT ATC CAT GGA AAT ATG GGT CTA AAA GAC 
Ala Pro Glu Trp Asp He His Gly Asn Met Gly Leu Lys Asp 
155 160 165 

CAG AGA TTG GCA CTA AAA TGG GTT TAT GAC AAC ATC GAA AAA 
20 Gin Arg Leu Ala Leu Lys Trp Val Tyr Asp Asn He Glu Lys 
170 175 130 

TTT GGT GGA GAC AGA GAT AAA ATC ACT ATA GCT GGA GAA TCT 
Phe Gly Gly Asp Arg Asp Lys He Thr He Ala Gly Glu Ser 
185 190 19 5 

25 GCT GGA GCA GCA ACT GTT CAT TTT CTG ATG ATG GAC AAT TCT 
Ala Gly Ala Ala Ser Val His Phe Leu Met Met Asp Asn Ser 
200 205 210 

ACT AGA AAA TAC TAC CAA AGG GCA ATT TTG CAG ACT GGG ACA 
Thr Arg Lys Tyr Tyr Gin Arg Ala He Leu Gin Ser Gly Thr 
30 215 220 

TTA CTC AAT CCG ACT GCT AAT CAA ATT CAA CCT CTG CAT AGA 
Leu Leu Asn Pro Thr Ala Asn Gin He Gin Pro Leu His Arg 
225 230 235 

TTT GAA AAA CTA AAA CAA GTG CTG AAC ATC ACG C?A AAA CA7. 7 56 

35 Phe Glu Lys Leu Lys Gin Val Leu Asn He Thr Gin Lys J. n 
240 245 250 

GAA CTC CTA AAT CTG GAC AAA AAT CAA ATT TTG CGA GCA GCC 
Glu Leu Leu Asn Leu Asp Lys Asn Gin He Leu Arg Ala Ala 
255 260 265 



546 



588 



630 



672 



714 



798 



40 TTA AAC AGA GTC CCA GAT AAC AAC GAC CAC GAA AGG GAC ACA 
Leu Asn Arg Val Pro Asp Asn Asn Asp His Glu Arg Asp Thr 
270 275 280 



840 
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GTA CCA GTA TTT AAT CCA GTC CTA GAA TCA CCA GAA TCT CCA 
Val Pro Val Phe Asn Pro Val Leu Glu Ser Pro Glu Ser Pro 
285 290 

GAC CCA ATA ACA TTT CCA TCT GCT TTA GAA AGA ATG AGA AAT 
5 Asp Pro He Thr Phe Pro Ser Ala Leu Glu Arg Met Arg Asn 
295 300 305 



GGT GAA TTT CCT GAC GTT GAT GTC ATC ATT GGA TTC AAT AGT 966 
Gly Glu Phe Pro Asp Val Asp Val He He Gly Phe Asn Ser 
310 315 320 

10 GCT GAA GGT TTA AGA TCT ATG CCA AGA GTA ACC AGA GGA AAC 10 08 

Ala Glu Gly Leu Arg Ser Met Pro Arg Val Thr Arg Gly Asn 
325 330 335 

ATG GAA GTT TAC AAG ACT TTG ACA AAT ATA GAG AGA GCT ATA 1050 
Met Glu Val Tyr Lys Thr Leu Thr Asn He Glu Arg Ala He 
15 340 345 350 

CCT AGA GAT GCT AAT ATT TGG AAA AAT CCT AAT GGC ATT GAG 1092 
Pro Arg Asp Ala Asn He Trp Lys Asn Pro Asn Gly He Glu 
355 360 

GAG AAA AAA CTT ATA AAA ATG CTT ACA GAG TTT TAT GAC CAA 1134 
20 Glu Lys Lys Leu He Lys Met Leu Thr Glu Phe Tyr Asp Gin 
365 * 370 375 

GTT AAA GAA CAA AAC GAT GAC ATC GAA GCC TAT GTC CAA CTA 117 6 

Val Lys Glu Gin Asn Asp Asp He Glu Ala Tyr Val Gin Leu 
380 385 390 

25 AAA GGC GAT GCT GGT TAT CTC CAA GGA ATT TAC CGT ACC TTG 1218 
Lys Gly Asp Ala Gly Tyr Leu Gin Gly He Tyr Arg Thr Leu 
395 400 405 

AAA GCC ATA TTT TTC AAT GAA ATC AAA AGA AAT TCC AAC TTG 1260 
Lys Ala He Phe Phe Asn Glu He Lys Arg Asn Ser Asn Leu 
30 ~ 410 415 420 

TAT TTG TAT AGG TTA TCA GAT GAT ACG TAT AGT GTA TAT AAA 13 02 

Tyr Leu Tyr Arg Leu Ser Asp Asp Thr Tyr Ser Val Tyr Lys 
425 430 

AGT TAT ATC TTG CCC '-AT CGA T JG GGT TCC TTG CCA GGA GTT 1344 
35 Ser Tyr He Leu Pro Tyr Arg Trp Gly Ser Leu Pro Gly Val 
435 440 445 

AGT CAT GGT GAT GAT TTA GGA TAT CTT TTT GCA AAC TCT TTG 13 86 

Ser His Gly Asp Asp Leu Gly Tyr Leu Phe Ala Asn Ser Leu 
450 455 460 

40 GAT GTT CCT ATT TTG GGA ACA ACG CAC ATT TCT ATA CCG CAA 14 2 8 

Asp Val Pro He Leu Gly Thr Thr His He Ser He Pro Gin 
465 470 475 
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GAT GCT ATG CAG ACT CTG GAA AGG ATG GTC AGG ATC TGG ACC 14 7 0 

Asp Ala Met Gin Thr Leu Glu Arg Met Val Arg lie Trp Thr 
480 485 490 

AAT TTT GTa AAG AAT GGA AAA CCT AC A TCA AAC ACT GAA GAT 1512 
5 Asn Phe Val Lys Asn Gly Lys Pro Thr Ser Asn Thr Glu Asp 

495 500 

GCA TCA TGT GAT ACA AAA AG A CAT TTA AAC GAC aTT TTT TGG 1554 
Ala Ser Cys Asp Thr Lys Arg His Leu Asn Asp lie Phe Trp 
505 510 515 

10 GAA CCA TAC AAC GAC GAA GAA CCA AAA TAT TTG GAC ATG GGA 15 96 

Glu Pro Tyr Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly 
520 525 530 

AAA GAA CAT TTT GAA ATG AAA AAT ATT TTG GAA CTA AAA CGC 163 8 

Lys Glu His Phe Glu Met Lys Asn lie Leu Glu Leu Lys Arg 
15 535 540 545 

ATG ATG CTT TGG GAT GAA GTT TAT AGA AAT GCG AAT TTG CGG 168 0 

Met Met Leu Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg 
550 555 560 

TTT AGA GTC TGT AAT GAA GAA AGT ATT AGA 1710 
20 Phe Arg Val Cys Asn Glu Glu Ser He Arg 
565 570 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1785 nucleotides 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) FEATURE: 

30 (A) NAME /KEY : CDS 

(B) LOCATION: 1..1785 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ATG TTA CCT CAC AGT GCA TTA GTT TTA TTT TTA TTT TTT TTA 4 2 

Met Leu Pro His Ser Ala Leu Val Leu Phe Leu Phe Phe Leu 
35 1 5 10 

TTT TTC TTA TTT ACA CCT GTC TTG TGC ATA CTA TGG GAT AAC 84 
Phe Phe Leu Phe Thr Pro Val Leu Cys He Leu Trp Asp Asn 
15 20 25 



40 



CTA 
Leu 



GAT CAG CAT TTG TGC AGA GTT CAA TTT AAC GGG ATC ACG 
Asp Gin His Leu Cys Arg Val Gin Phe Asn Gly He Thr 
30 35 40 



126 
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GAA GGA AAA CCG TTC CGA TAT AAA GAT CAT AAA AAT GAT GTA 168 
Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Lys Asn Asp Val 
45 50 55 

TAT TGT TCC TAT TTG GGA ATT CCT TAT GCA GAA CCG CCT ATT 210 
5 Tyr Cys Ser Tyr Leu Gly He Pro Tyr Ala Glu Pro Pro He 
60 65 70 

GGA CCA TTG CGA TTT CAG TCT CCA AAA CCA ATA TCA AAT CCA 252 
Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro He Ser Asn Pro 
75 80 

10 AAA ACA GGA TTC GTT CAG GCT CGG TCT TTA GGA GAC AAA TGT 2 94 

Lys Thr Gly Phe Val Gin Ala Arg Ser Leu Gly Asp Lys Cys 
85 90 95 

TTC CAG GAA AGT CTA ATA TAT TCT TAT GCA GGA AGC GAA GAT 3 36 

Phe Gin Glu Ser Leu He Tyr Ser Tyr Ala Gly Ser Glu Asp 
15 100 105 no 

TGC TTA TAT CTG AAT ATA TTC ACG CCA GAG ACT GTT AAT TCT 37 8 

Cys Leu Tyr Leu Asn He Phe Thr Pro Glu Thr Val Asn Ser 
115 120 125 

GCG AAC AAT ACA AAA TAT CCT GTA ATG TTC TGG ATC CAT GGA 42 0 

20 Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp He His Gly 
130 135 140 

GGC GCA TTC AAC CAA GGA TCA GGA TCT TAT AAT TTT TTT GGA 4 62 

Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn Phe Phe Gly 
145 150 

25 CCT GAT TAT TTG ATC AGG GAA GGA ATT ATT TTG GTC ACT ATC 504 
Pro Asp Tyr Leu He Arg Glu Gly He He Leu Val Thr He 
155 160 165 

AAC TAT AGA TTA GGA GTT TTC GGT TTT CTA TCA GCG CCG GAA 54 6 

Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser Ala Pro Glu 
30 170 175 180 

TGG GAT ATC CAT GGA AAT ATG GGT CTA AAA GAC CAG AGA TTG 588 
Trp Asp He His Gly Asn Met Gly Leu Lys Asp Gin Arg Leu 
185 190 195 

GCA CTA AAA TGG GTT TAT GAC AAC ATC GAA AAA TTT GGT GGA 63 r 

35 Ala Leu Lys Trp Val tyr Asp Asn He Glu Lys Phe Gly Gly 
200 205 210 

GAC AGA GAT AAA ATC ACT ATA GCT GGA GAA TCT GCT GGA GCA 672 
Asp Arg Asp Lys He Thr He Ala Gly Glu Ser Ala Gly Ala 
215 220 

40 GCA AGT GTT CAT TTT CTG ATG ATG GAC AAT TCT ACT AGA AAA 714 
Ala Ser Val His Phe Leu Met Met Asp Asn Ser Thr Arg Lys 
225 230 235 



-157- 



FC-1-C1-PCT 



TAC TAC CAA AGG GCA ATT TTG CAG AGT GGG ACA TTA CTC AAT 756 
Tyr Tyr Gin Arg Ala He Leu Gin Ser Gly Thr Leu Leu Asn 
240 245 250 

CCG ACT GCT AAT CAA ATT CAA CCT CTG CAT AGA TTT GAA AAA 7 98 

5 Pro Thr Ala Asn Gin He Gin Pro Leu His Arg Phe Glu Lys 
255 260 265 

CTA AAA CAA GTG CTG AAC ATC ACG CAA AAA CAA GAA CTC CTA 84 0 

Leu Lys Gin Val Leu Asn He Thr Gin Lys Gin Glu Leu Leu 
270 275 280 

10 AAT CTG GAC AAA AAT CAA ATT TTG CGA GCA GCC TTA AAC AGA 8 82 

Asn Leu Asp Lys Asn Gin He Leu Arg Ala Ala Leu Asn Arg 
285 290 

GTC CCA GAT AAC AAC GAC CAC GAA AGG GAC ACA GTA CCA GTA 92 4 

Val Pro Asp Asn Asn Asp His Glu Arg Asp Thr Val Pro Val 
15 295 300 305 

TTT AAT CCA GTC CTA GAA TCA CCA GAA TCT CCA GAC CCA ATA 96 6 

Phe Asn Pro Val Leu Glu Ser Pro Glu Ser Pro Asp Pro He 
310 315 320 

ACA TTT CCA TCT GCT TTA GAA AGA ATG AGA AAT GGT GAA TTT 10 0 8 

20 Thr Phe Pro Ser Ala Leu Glu Arg Met Arg Asn Gly Glu Phe 
325 330 335 

CCT GAC GTT GAT GTC ATC ATT GGA TTC AAT AGT GCT GAA GGT 1050 
Pro Asp Val Asp Val He He Gly Phe Asn Ser Ala Glu Gly 
340 345 350 

25 TTA AGA TCT ATG CCA AGA GTA ACC AGA GGA AAC ATG GAA GTT 10 92 

Leu Arg Ser Met Pro Arg Val Thr Arg Gly Asn Met Glu Val 
355 360 

TAC AAG ACT TTG ACA AAT ATA GAG AGA GCT ATA CCT AGA GAT 1134 
Tyr Lys Thr Leu Thr Asn He Glu Arg Ala He Pro Arg Asp 
30 365 370 375 

GCT AAT ATT TGG AAA AAT CCT AAT GGC ATT GAG GAG AAA AAA 117 6 

Ala Asn He Trp Lys Asn Pro Asn Gly He Glu Glu Lys Lys 
380 * 385 390 

CTT ATA AAA ATG CTT ACA GAG TTT TAT G,\C CAA GTT AAA GAA 1218 
35 Leu He Lys Met Leu Thr Glu Phe Tyr i.sp Gin \±: Lys Glu 
395 400 405 

CAA AAC GAT GAC ATC GAA GCC TAT GTC CAA CTA AAA GGC GAT 12 60 

Gin Asn Asp Asp He Glu Ala Tyr Val Gin Leu Lys Gly Asp 
410 415 420 

40 GCT GGT TAT CTC CAA GGA ATT TAC CGT ACC TTG AAA GCC ATA 13 02 

Ala Gly Tyr Leu Gin Gly He Tyr Arg Thr Leu Lys Ala He 
425 430 
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TTT TTC AAT GAA ATC AAA AGA AAT TCC AAC TTG TAT TTG TAT 1344 
Phe Phe Asn Glu He Lys Arg Asn Ser Asn Leu Tyr Leu Tyr 
435 440 445 

AGG TTA TCA GAT GAT ACG TAT AGT GTA TAT AAA AGT TAT ATC 13 86 

5 Arg Leu Ser Asp Asp Thr Tyr Ser Val Tyr Lys Ser Tyr He 
450 455 460 

TTG CCC TAT CGA TGG GGT TCC TTG CCA GGA GTT AGT CAT GGT 1428 
Leu Pro Tyr Arg Trp Gly Ser Leu Pro Gly Val Ser His Gly 
465 470 475 

10 GAT GAT TTA GGA TAT CTT TTT GCA AAC TCT TTG GAT GTT CCT 147 0 

Asp Asp Leu Gly Tyr Leu Phe Ala Asn Ser Leu Asp Val Pro 
480 485 490 

ATT TTG GGA ACA ACG CAC ATT TCT ATA CCG CAA GAT GCT ATG 1512 
He Leu Gly Thr Thr His He Ser He Pro Gin Asp Ala Met 
15 " 495 500 

CAG ACT CTG GAA AGG ATG GTC AGG ATC TGG ACC AAT TTT GTA 1554 
Gin Thr Leu Glu Arg Met Val Arg He Trp Thr Asn Phe Val 
505 510 515 

AAG AAT GGA AAA CCT ACA TCA AAC ACT GAA GAT GCA TCA TGT 1596 
20 Lys Asn Gly Lys Pro Thr Ser Asn Thr Glu Asp Ala Ser Cys 
520 " 525 530 

GAT ACA AAA AGA CAT TTA AAC GAC ATT TTT TGG GAA CCA TAC 16 3 8 

Asp Thr Lys Arg His Leu Asn Asp He Phe Trp Glu Pro Tyr 
535 540 545 

25 AAC GAC GAA GAA CCA AAA TAT TTG GAC ATG GGA AAA GAA CAT 16 80 

Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly Lys Glu His 
550 555 560 

TTT GAA ATG AAA AAT ATT TTG GAA CTA AAA CGC ATG ATG CTT 1722 
Phe Glu Met Lys Asn He Leu Glu Leu Lys Arg Met Met Leu 
30 565 570 

TGG GAT GAA GTT TAT AGA AAT GCG AAT TTG CGG TTT AGA GTC 17 64 

Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg Phe Arg Val 
575 ~ 580 585 



TGT AAT GAA GAA AGT ATT AGA 
35 Cys Asn Glu Jlu Ser H. Arg 
590 595 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1785 nucleotides 

4Q (b) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



1785 
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(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TCTAATACTT TCTTCATTAC AGACTCTAAA CCGCAAATTC GCATTTCTAT 
AAACTTCATC CCAAAGCATC ATGCGTTTTA GTTCCAAAAT ATTTTTCATT 
TCAAAATGTT CTTTTCCCAT GTCCAAATAT TTTGGTTCTT CGTCGTTGTA 
5 TGGTTCCCAA AAAATGTCGT TTAAATGTCT TTTTGTATcA C ATG ATG CAT 
CTTCAGTGTT TGATGTAGGT TTTCCATTCT TTACAAAATT GGTCCAGATC 
CTGACCATCC TTTCCAGAGT CTGCATAGCA TCTTGCGGTA TAGAAATGTG 
CGTTGTTCCC AAAATAGGAA CATCCAAAGA GTTTGCAAAA AGATATCCTA 
AATCATCACC ATGACTAACT CCTGGCAAGG AACCCCATCG ATAGGGCAAG 
10 ATATAACTTT TATATACACT ATACGTATCA TCTGATAACC TATACAAATA 
CAAGTTGGAA TTTCTTTTGA TTTCATTGAA AAATATGGCT TTCAAGGTAC 
GGTAAATTCC TTGGAGATAA CCAGCATCGC CTTTTAGTTG GACATAGGCT 
TCGATGTCAT CGTTTTGTTC TTTAACTTGG TCATAAAACT CTGTAAGCAT 
TTTTATAAGT TTTTTCTCCT CAATGCCATT AGGATTTTTC CAAATATTAG 
15 CATCTCTAGG TATAGCTCTC TCTATATTTG TCAAAGTCTT GTAAACTTCC 
ATGTTTCCTC TGGTTACTCT TGGCATAGAT CTTAAACCTT CAGCACTATT 
GAATCCAATG ATGACATCAA CGTCAGGAAA TTCACCATTT CTCATTCTTT 
CTAAAGCAGA TGGAAATGTT ATTGGGTCTG GAGATTCTGG TGATTCTAGG 
ACTGGATTAA ATACTGGTAC TGTGTCCCTT TCGTGGTCGT TGTTATCTGG 
20 GACTCTGTTT AAGGCTGCTC GCAAAATTTG ATTTTTGTCC AGATTTAGGA 
GTTCTTGTTT TTGCGTGATG TTCAGCACTT GTTTTAGTTT TTCAAATCTA 
TGCAGAGGTT GAATTTGATT AGCAGTCGGA TTGAGTAATG TCCCACTCTG 
CAAAATTGCC CTTTGGTAGT ATTTTCTAGT AGAATTGTCC ATCATCAGAA 
AATGAACACT TGCTGCTCCA GCAGATTCTC C AG CTATAGT GATTTTATCT 
25 CTGTCTCCAC CAAATTTTTC GATGTTGTCA TAAACCCATT TTAGTGCCAA 
TCTCTGGTCT TTTAGACCCA TATTTCCATG GATATCCCAT TCCGGCGCTG 
ATAGAAAACC GAAAACTCCT AATCTATAGT TGATAGTGAC CAAAATAATT 
CCTTCCCTGA TCAAATAATC AGGTCCAAAA AAATTATAAG ATCCTGATCC 
TTGGTTGAAT GCGCCTCCAT GGATCCAGAA CATTACAGGA TATTTTGTAT 
30 TGTTCGCAGA ATTAACAGTC TCTGGCGTGA ATATATTCAG ATATAAGCAA 
TCTTCGCTTC CTGCATAAGA ATATATTAGA CTTTCCTGGA AACATTTGTC 
TCCTAAAGAC CGAGCCTGAA CGAATCCTGT TTTTGGATTT GATATTGGTT 
TTGGAGACTG AAATCGCAAT GGTCCAATAG GCGGTTCTGC ATAAGGAATT 
CCCAAATAGG AACAATATAC ATCATTTTTA TGATCTTTAT ATCGGAACGG 
35 TTTTCCTTCC GTGATCCCGT TAAATTGAAC TCTGCACAAA TGCTGATCTA 
GGTTATCCCA TAGTATGCAC AAGACAGGTG TAAATAAGAA AAATAAAAAA 
AATAAAAATA AAACTAATGC ACTGTGAGGT AACAT 

(2) INFORMATION FOR SEQ ID NO:36: 
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(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2007 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: 



CDNA 



45 



(iii) 



FEATURE : 

(A) NAME/ KEY: 

(B) LOCATION: 



CDS 
11. .1594 



(iv) 



SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
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AGTTCCAACG ATG GCT GAT CTA CAA GTG ACT TTG CTT CAA GGT 4 3 

Met Ala Asp Leu Gin Val Thr Leu Leu Gin Gly 
1 5 10 

ACT TTA AAA GGA AAA GAG CAA ATT AGT GAA AAA GGA AAT GTG 85 
5 Thr Leu Lys Gly Lys Glu Gin He Ser Glu Lys Gly Asn Val 
15 20 25 

TTC CAT AGT TAT TCT GGA ATT CCA TAT GCC AAA CCT CCT GTA 12 7 

Phe His Ser Tyr Ser Gly He Pro Tyr Ala Lys Pro Pro Val 
30 35 

10 GGT GAT CTA AGA TTT AAG CCA CCT CAA CCT GCA GAA CCT TGG 16 9 

Gly Asp Leu Arg Phe Lys Pro Pro Gin Pro Ala Glu Pro Trp 
40 ~ 45 50 

TCA GGT GTT CTT GAT GCT AGT AAA GAA GGG AAT AGT TGT AGA 211 
Ser Gly Val Leu Asp Ala Ser Lys Glu Gly Asn Ser Cys Arg 
15 55 60 65 

TCA GTA CAT TTT ATT AAA AAA ATT AAA GTA GGG GCT GAA GAT 2 53 

Ser Val His Phe He Lys Lys He Lys Val Gly Ala Glu Asp 
70 75 80 

TGT TTA TAC CTC AAT GTC TAT GTA CCA AAA ACA TCA GAG AAA 2 95 

20 Cys Leu Tyr Leu Asn Val Tyr Val Pro Lys Thr Ser Glu Lys 
85 90 95 

TCA CTT CTT CCA GTA ATG GTA TGG ATA CAT GGA GGA GGC TTC 337 
Ser Leu Leu Pro Val Met Val Trp He His Gly Gly Gly Phe 
100 1° 5 

25 TTC ATG GGA TCT GGA AAT AGT GAT ATG TAT GGT CCT GAA TAT 37 9 

Phe Met Gly Ser Gly Asn Ser Asp Met Tyr Gly Pro Glu Tyr 
110 H5 120 

TTG ATG GAT TAT GGA ATT GTT CTG GTT ACT TTC AAT TAT CGA 421 
Leu Met Asp Tyr Gly He Val Leu Val Thr Phe Asn Tyr Arg 
30 125 ' 130 135 

TTA GGT GTT TTG GGA TTT TTG AAC CTG GGA ATA GAA GAA GCG 4 63 

Leu Gly Val Leu Gly Phe Leu Asn Leu Gly He Glu Glu Ala 
140 145 150 

CCT GGC AAT GTT GGT TTG ATG GAC CAG GTT GAA GCT CTA AAA 505 
35 Pro Gly Asn Val Gly Leu Met Asp Gin Val Glu Ala Leu Ly^ 
155 160 165 

TGG GTA AAA AAC AAT ATT GCA TCC TTT GGT GGT GAC CCC AAC 547 
Trp Val Lys Asn Asn He Ala Ser Phe Gly Gly Asp Pro Asn 
170 175 

40 AAT GTG ACT ATT TTT GGA GAA TCA GCA GGT GGT GCA AGT GTT 5 89 

Asn Val Thr He Phe Gly Glu Ser Ala Gly Gly Ala Ser Val 
180 185 190 
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CAT TAT TTG ATG TTA TCA GAT CTT TCC AAA GGA CTT TTT CAT 631 
His Tyr Leu Met Leu Ser Asp Leu Ser Lys Gly Leu Phe His 
195 200 205 

AAA GCG ATC TCA CAA AGT GGA AGT GCT TTT AAT CCT TGG GCA 67 3 

5 Lys Ala He Ser Gin Ser Gly Ser Ala Phe Asn Pro Trp Ala 
210 215 220 

CTT CAA CAT GAT AAT AAT AAA GAA AAT GCA TTC CGC CTC TGC 715 
Leu Gin His Asp Asn Asn Lys Glu Asn Ala Phe Arg Leu Cys 
225 230 235 

10 AAA CTT CTG GGT CAT CCT GTC GAT AAC GAG ACA GAA GCT CTA 7 57 

Lys Leu Leu Gly His Pro Val Asp Asn Glu Thr Glu Ala Leu 
240 245 

AAA ATC CTT CGT CAA GCC CCC ATA GAT GAT CTT ATA GAC AAC 7 99 

Lys He Leu Arg Gin Ala Pro He Asp Asp Leu He Asp Asn 
15 250 255 260 

AG A ATA AAA CCA AAA GAC AAA GGC CAA CTT ATT ATA GAC TAT 841 
Arg He Lys Pro Lys Asp Lys Gly Gin Leu He He Asp Tyr 
265 270 275 

CCT TTT CTA CCA ACA ATA GAA AAA CGT TAT CAA AAT TTT GAA 8 83 

20 Pro Phe Leu Pro Thr He Glu Lys Arg Tyr Gin Asn Phe Glu 
280 285 290 

CCA TTC TTG GAC CAG TCT CCA TTA TCA AAA ATG CAA TCA GGC 92 5 

Pro Phe Leu Asp Gin Ser Pro Leu Ser Lys Met Gin Ser Gly 
295 300 305 

25 AAT TTC ACA AAA GTC CCA TTT ATA TGT GGA TAC AAC AGT GCT 967 
Asn Phe Thr Lys Val Pro Phe He Cys Gly Tyr Asn Ser Ala 
310 315 

GAA GGA ATT TTA GGT TTA ATG GAC TTC AAG GAT GAC CCA AAT 100 9 

Glu Gly He Leu Gly Leu Met Asp Phe Lys Asp Asp Pro Asn 
30 320 * 325 330 

ATA TTT GAG AAG TTT GAA GCT GAT TTT GAA AGA TTT GTA CCA 1051 
He Phe Glu Lys Phe Glu Ala Asp Phe Glu Arg Phe Val Pro 
335 340 345 

GTA GAT TTG AAT CTA ACT TTA AGO TCT AAG GAA TCT AAA AAA 10 93 

35 Val Asp Leu Asn Leu Thr Leu ^: g Ser L, 3 31u Ser Lys Lys 
350 355 360 

TTG GCT GAA GAA ATG AGA AAG TTT TAT TAC CAA GAC GAA CCT 113 5 

Leu Ala Glu Glu Met Arg Lys Phe Tyr Tyr Gin Asp Glu Pro 
365 m 370 375 

40 GTT TCT TCA GAC AAC AAA GAA AAA TTT GTC AGT GTT ATT AGT 117 7 

Val Ser Ser Asp Asn Lys Glu Lys Phe Val Ser Val He Ser 
380 385 
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GAT ACT TGG TTT TTG AGA GGG ATT AAA AAT ACT GCA AGA TAT 1219 
Asp Thr Trp Phe Leu Arg Gly He Lys Asn Thr Ala Arg Tyr 
390 395 400 

ATA ATT GAA CAT TCC TCA GAA CCG TTA TAT TTA TAT GTT TAT 1261 
He He Glu His Ser Ser Glu Pro Leu Tyr Leu Tyr Val Tyr 



405 



410 415 



AGT TTT GAT GAT TTT GGT TTT TTG AAG AAA CTT GTA TTA GAT 13 03 

Ser Phe Asp Asp Phe Gly Phe Leu Lys Lys Leu Val Leu Asp 
420 425 430 

10 CCT AAT ATT GAA GGA GCA GCT CAT GGA GAT GAG CTG GGA TAT 13 4 5 

Pro Asn He Glu Gly Ala Ala His Gly Asp Glu Leu Gly Tyr 



435 



440 445 



CTT TTC AAG ATG AGT TTT ACA GAA TTT CCA AAA GAT TTA CCA 13 87 

Leu Phe Lys Met Ser Phe Thr Glu Phe Pro Lys Asp Leu Pro 
15 " 450 455 

AGT GCA GTG GTG AAT AGG GAA CGA TTG TTG CAA CTT TGG ACA 14 29 

Ser Ala Val Val Asn Arg Glu Arg Leu Leu Gin Leu Trp Thr 
460 465 470 

AAT TTT GCA AAA ACA GGA AAT CCC ACT CCT GAA ATC AAT GAT 14 71 

20 Asn Phe Ala Lys Thr Gly Asn Pro Thr Pro Glu He Asn Asp 
475 480 485 

GTT ATA ACA ACA AAA TGG GAT AAA GCT ACT GAG GAA AAA TCA 1513 
Val He Thr Thr Lys Trp Asp Lys Ala Thr Glu Glu Lys Ser 
490 495 500 

25 GAT CAT ATG GAT ATC GAT AAT ACT TTG AGA ATG ATT CCA GAT 15 5 5 

Asp His Met Asp He Asp Asn Thr Leu Arg Met He Pro Asp 



505 



510 515 



CCT GAT GCA AAA CGA CTT AGA TTT TGG AAT AAA TTT TTA TGA 15 97 

Pro Asp Ala Lys Arg Leu Arg Phe Trp Asn Lys Phe Leu 
30 520 525 

TAA ATATACCAAT TATCGATTTT ATTATAGAGT TTCTGTATTA 1640 

GTATAATTAT CACGTTTAGA TGTACGAGAT TCAATTGGCT CTAATTGAAG 1690 

TATATTTCGA TTTCAAATTT ACTCTGATTA TTGGAAAAAA AGCTTTTACA 174 0 

GTTGTAATAA TCAAGAAGTA GGTGGTAAAT TTAGAACAAA TTCTGTTTTA 17 90 

35 GTG^TT'GCG CATTCAACAG ATGGTGTACT GTGCCTAAAT TTGTCGCTCT 184 0 

TCTTGiiAGAA CT^A -CTAAA AATGTGATTA ATGGACGCCA CATTATTTAT 1890 

ATTTGATATT ATTACCATCT TTGTATCATA TTTGCTTTTA TTTTTTCATT 194 0 

TTTTTTTTAT TTCAAATATA TTGTTTTTTT ATAAAAAAAA AAAAAAAAAA 1990 
AAAAAAAAAA AAAAAAA 

40 (2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Ala Asp Leu Gin Val Thr Leu Leu Gin Gly Thr Leu Lys 
1 5 10 

5 Gly Lys Glu Gin lie Ser Glu Lys Gly Asn Val Phe His Ser 
15 20 25 

Tyr Ser Gly lie Pro Tyr Ala Lys Pro Pro Val Gly Asp Leu 
30 35 40 

Arg Phe Lys Pro Pro Gin Pro Ala Glu Pro Trp Ser Gly Val 
10 45 50 55 

Leu Asp Ala Ser Lys Glu Gly Asn Ser Cys Arg Ser Val His 
60 65 70 

Phe He Lys Lys He Lys Val Gly Ala Glu Asp Cys Leu Tyr 
75 80 

15 Leu Asn Val Tyr Val Pro Lys Thr Ser Glu Lys Ser Leu Leu 
85 90 95 

Pro Val Met Val Trp He His Gly Gly Gly Phe Phe Met Gly 
100 105 HO 

Ser Gly Asn Ser Asp Met Tyr Gly Pro Glu Tyr Leu Met Asp 
20 H5 120 125 

Tyr Gly He Val Leu Val Thr Phe Asn Tyr Arg Leu Gly Val 
130 135 140 

Leu Gly Phe Leu Asn Leu Gly He Glu Glu Ala Pro Gly Asn 
145 150 

25 Val Gly Leu Met Asp Gin Val Glu Ala Leu Lys Trp Val Lys 
155 160 165 

Asn Asn He Ala Ser Phe Gly Gly Asp Pro Asn Asn Val Thr 
170 175 180 

He Phe Gly Glu Ser Ala Gly Gly Ala Ser Val His Tyr Leu 
30 185 190 195 

Met Leu Ser Asp Leu Ser Lys Gly Leu Phe His Lys Ala He 
200 205 210 

Ser Gin Ser Gly Ser Ala Phe Asn Pro Trp Ala Leu Gin His 
215 220 

35 Asp Asn Asn Lys Glu Asn Ala Phe Arg Leu Cys Lys Leu Leu 
225 230 235 

Gly His Pro Val Asp Asn Glu Thr Glu Ala Leu Lys He Leu 
240 245 250 
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Arg Gin Ala Pro He Asp Asp Leu He Asp Asn Arg He Lys 
255 260 265 

Pro Lys Asp Lys Gly Gin Leu He He Asp Tyr Pro Phe Leu 
270 275 280 

5 Pro Thr He Glu Lys Arg Tyr Gin Asn Phe Glu Pro Phe Leu 

285 290 

Asp Gin Ser Pro Leu Ser Lys Met Gin Ser Gly Asn Phe Thr 
295 3°° 305 

Lys Val Pro Phe He Cys Gly Tyr Asn Ser Ala Glu Gly He 
10 310 315 320 

Leu Gly Leu Met Asp Phe Lys Asp Asp Pro Asn He Phe Glu 
325 330 335 

Lys Phe Glu Ala Asp Phe Glu Arg Phe Val Pro Val Asp Leu 
340 345 350 

15 Asn Leu Thr Leu Arg Ser Lys Glu Ser Lys Lys Leu Ala Glu 

355 360 

Glu Met Arg Lys Phe Tyr Tyr Gin Asp Glu Pro Val Ser Ser 
365 370 375 

Asp Asn Lys Glu Lys Phe Val Ser Val He Ser Asp Thr Trp 
20 380 385 390 

Phe Leu Arg Gly He Lys Asn Thr Ala Arg Tyr He He Glu 
395 400 405 

His Ser Ser Glu Pro Leu Tyr Leu Tyr Val Tyr Ser Phe Asp 
410 -415 420 

25 Asp Phe Gly Phe Leu Lys Lys Leu Val Leu Asp Pro Asn He 

425 430 

Glu Gly Ala Ala His Gly Asp Glu Leu Gly Tyr Leu Phe Lys 
435 440 445 

Met Ser Phe Thr Glu Phe Pro Lys Asp Leu Pro Ser Ala Val 
30 450 455 460 

Val Asn Arg Glu Arg Leu Leu Gin Leu Trp Thr Asn Phe Ala 
465 470 475 

Lys Thr Gly Asn Pro Thr Pro Glu He Asn Asp Val He Thr 
480 485 490 

35 Thr Lys Trp Asp Lys Ala Thr Glu Glu Lys Ser Asp His Met 

495 500 

Asp He Asp Asn Thr Leu Arg Met He Pro Asp Pro Asp Ala 
505 510 515 
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Lys Arg Leu Arg Phe Trp Asn Lys Phe Leu 
520 525 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2007 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 (Hi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTATAAA AAAACAATAT 
ATTTGAAATA AAAAAAAAAT GAAAAAATAA AAGCAAATAT GATACAAAGA 
TGGTAATAAT AT C AAAT AT A AATAATGTGG CGTCCATTAA TCACATTTTT 
AGTTCAGTTC TTCAAGAAGA GCGACAAATT TAGGCACAGT ACACCATCTG 
15 TTGAATGCGC AAATCACTAA AACAGAATTT GTTCTAAATT TACCACCTAC 
TTCTTGATTA TTACAACTGT AAAAGCTTTT TTTCCAATAA TCAGAGTAAA 
TTTGAAATCG AAATATACTT CAATTAGAGC CAATTGAATC TCGTACATCT 
AAACGTGATA ATTATACTAA TACAGAAACT CTATAATAAA ATCGATAATT 
GGTATATTTA TCATAAAAAT TTATTC C AAA ATCTAAGTCG TTTTGCATCA 
20 GGATCTGGAA TCATTCTCAA AGTATTATCG ATATCCATAT GATCTGATTT 
TTCCTCAGTA GCTTTATCCC ATTTTGTTGT TATAACATCA TTGATTTCAG 
GAGTGGGATT TCCTGTTTTT GCAAAATTTG TCCAAAGTTG CAACAATCGT 
TCCCTATTCA CCACTGCACT TGGTAAATCT TTTGGAAATT CTGTAAAACT 
CATCTTGAAA AGATATCCCA GCTCATCTCC ATGAGCTGCT CCTTCAATAT 
25 TAGGATCTAA TACAAGTTTC TTCAAAAAAC CAAAATCATC AAAACTATAA 
ACATATAAAT ATAACGGTTC TGAGGAATGT TCAATTATAT ATCTTGCAGT 
ATTTTTAATC CCTCTCAAAA ACCAAGTATC ACTAATAACA CTGACAAATT 
TTTCTTTGTT GTCTGAAGAA ACAGGTTCGT CTTGGTAATA AAACTTTCTC 
ATTTCTTCAG CCAATTTTTT AGATTCCTTA GACCTTAAAG TTAGATTCAA 
30 ATCTACTGGT ACAAATCTTT CAAAATCAGC TTCAAACTTC TCAAATATAT 
TTGGGTCATC CTTGAAGTCC ATTAAACCTA AAATTCCTTC AGCACTGTTG 
TATCCACATA TAAATGGGAC TTTTGTGAAA TTGCCTGATT GCATTTTTGA 
TAATGGAGAC TGGTCCAAGA ATGGTTCAAA ATTTTGATAA CGTTTTTCTA 
TTGTTGGTAG AAAAGGATAG TCTATAATAA GTTGGCCTTT GTCTTTTGGT 
35 TTTATTCTGT TGTCTATAAG ATCATCTATG GGGGCTTGAC GAAGGATTTT 
TAGAGCTTCT GTCTCGTTAT CGACAGGATG ACC CAGAAGT TTGCAGAGGC 
GGAATGCATT TTCTTTATTA TTATCATGTT GAAGTGCCCA AGGATTAAAA 
GCACTTCCAC TTTGTGAGAT CGCTTTATGA AAAAGTCCTT TGGAAAGATC 
TGATAACATC AAATAATGAA CACTTGCACC ACCTGCTGAT TCTCCAAAAA 
40 TAGTCACATT GTTGGGGTCA CCACCAAAGG ATGCAATATT GTTTTTTACC 

caSSagag cttcaacctu .tccatca; ^> .CAACATTGC CAGGCGCTTC 1550 
TTCTATTCCC AGGTTCAAAA ATCCCAAAAC ACCTAATCGA TAATTGAAAG 
TAACCAGAAC AATTCCATAA TC CATC AAAT ATTCAGGACC AT AC AT AT C A 
CTATTTCCAG ATCCCATGAA GAAGCCTCCT CCATGTATCC ATACCATTAC 
45 TGGAAGAAGT GATTTCTCTG ATGTTTTTGG TACATAGACA TTGAGGTATA 
AACAATCTTC AGCCCCTACT TTAATTTTTT TAATAAAATG TACTGATCTA 
CAACTATTCC CTTCTTTACT AGCATCAAGA ACACCTGACC AAGGTTCTGC 
AGGTTGAGGT GGCTTAAATC TTAGATCACC TACAGGAGGT TTGGCATATG 
GAATTCCAGA ATAACTATGG AACACATTTC CTTTTTCACT AATTTGCTCT 
50 TTTCCTTTTA AAGTACCTTG AAGCAAAGTC ACTTGTAGAT CAGCCATCGT 
TGGAACT 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
5 (C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Asp Pro Pro Thr Val Thr Leu Pro Gin Gly Glu Leu 
1 5 10 

10 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 

(iii) FEATURE: 

(A) NAME / KEY : Xaa = any ammo acid 

(B) LOCATION: 21 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

20 Asp Pro Pro Thr Val Thr Leu Pro Gin Gly Glu Leu Val Gly 

_ i n 



1 



5 10 



Lys Ala Thr Asn Glu Asn Xaa Lys 
15 20 

(2) INFORMATION FOR SEQ ID NO: 41: 

25 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: linear 

(ii> MOLECULE TYPE: peptide 

30 (iii) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Asp Pro Pro Thr Val Thr Leu Pro Gin Gly Glu Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

35 (a) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
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(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Asp Pro Pro Thr Val Thr Leu Pro Gin Gly Glu Leu Val Gly 
5 1 5 10 

Lys Ala Leu Ser Asn Glu Asn 
15 20 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

15 Asp Pro Pro Thr Val Thr Leu Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 3 amino acids 

20 (B) TYPE: amino acid 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Asp Pro Pro Thr Val Thr Leu Pro Gin Gly Glu Leu Val Gly 
25 1 5 10 

Lys Ala Leu Thr Asn Glu Asn Gly Lys 
15 20 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2 0 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
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AATTAACCCT CACTAAAGGG 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(iii) FEATURE: 

(A) NAME/ KEY : R - A or G 

(B) LOCATION: 2, 12, 14 

(iv) FEATURE: 

(A) NAME/ KEY : D = A, G or T 

(B) LOCATION: 3, 6, 9, 15 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 46 
ARDCCDCCDC CRTRDAT 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 bases 

) (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 47 
5 TGTGCTCGAG ATGGGATAAC CTAGATCAGC ATTTGTGC 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 bases 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:4E 
TTAAGGTACC TCATC TAATA CTTCCTTCAT TACAG 



35 (2) INFORMATION FOR SEQ ID NO: 49: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
AAAACTGCAG TATAAATATG TTACCTCACA GTAGTG 36 

(2) INFORMATION FOR SEQ ID NO:50: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

!5 (ii) MOLECULE TYPE: primer 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
TGCTCTAGAT TATCTAATAC TTCCTTCATT ACAG 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1540 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 (iii) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..154 0 

(iv) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CTT CAA GGT ACT TTA AAA GG\ AAA GAG CAA ATT AGT GAA AAA 
30 Leu Gin Gly In, Leu Lys 3"y Lys Glu Gin He Ser Glu Lys 



i 



5 10 



GGA AAT GTG TTC CAT AGT TAT TCT GGA ATT CCA TAT GCC AAA 
Gly" val Phe His Ser Tyr Ser Gly He Pro Tyr Ala Lys 

15 20 25 

35 CCT CCT GTA GGT GAT CTA AGA TTT AAG CCA CCT -CAA CCT GCA 
Pro Pro Sal Gly Asp Leu Arg Phe Lys Pro Pro Gin Pro Ala 
30 



34 



42 



84 



126 



35 40 
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GAA CCT TGG TCA GGT GTT CTT GAT GCT AGT AAA GAA GGG AAT 16 8 
Glu Pro Trp Ser Gly Val Leu Asp Ala Ser Lys Glu Gly Asn 
45 50 55 

AGT TGT AGA TCA GTA CAT TTT ATT AAA AAA ATT AAA GTA GGG 210 
5 Ser Cys Arg Ser Val His Phe lie Lys Lys He Lys Val Gly 
60 65 70 

GCT GAA GAT TGT TTA TAC CTC AAT GTC TAT GTA CCA AAA ACA 2 52 
Ala Glu Asp Cys Leu Tyr Leu Asn Val Tyr Val Pro Lys Thr 
75 80 

10 TCA GAG AAA TCA CTT CTT CCA GTA ATG GTA TGG ATA CAT GGA 2 94 
Ser Glu Lys Ser Leu Leu Pro Val Met Val Trp He His Gly 
85 90 95 

GGA GGC TTC TTC ATG GGA TCT GGA AAT AGT GAT ATG TAT GGT 33 6 
Gly Gly Phe Phe Met Gly Ser Gly Asn Ser Asp Met Tyr Gly 
15 100 105 HO 

CCT GAA TAT TTG ATG GAT TAT GGA ATT GTT CTG GTT ACT TTC 378 
Pro Glu Tyr Leu Met Asp Tyr Gly He Val Leu Val Thr Phe 
115 120 125 

AAT TAT CGA TTA GGT GTT TTG GGA TTT TTG AAC CTG GGA ATA 42 0 
20 Asn Tyr Arg Leu Gly Val Leu Gly Phe Leu Asn Leu Gly He 
130 135 140 

GAA GAA GCG CCT GGC AAT GTT GGT TTG ATG GAC CAG GTT GAA 4 62 
Glu Glu Ala Pro Gly Asn Val Gly Leu Met Asp Gin Val Glu 
145 150 

25 GCT CTA AAA TGG GTA AAA AAC AAT ATT GCA TCC TTT GGT GGT 5 04 
Ala Leu Lys Trp Val Lys Asn Asn He Ala Ser Phe Gly Gly 
155 160 165 

GAC CCC AAC AAT GTG ACT ATT TTT GGA GAA TCA GCA GGT GGT 546 
Asp Pro Asn Asn Val Thr He Phe Gly Glu Ser Ala Gly Gly 
30 170 175 180 

GCA AGT GTT CAT TAT TTG ATG TTA TCA GAT CTT TCC AAA GGA 58 8 
Ala Ser Val His Tyr Leu Met Leu Ser Asp Leu Ser Lys Gly 
185 190 195 

crT<T TTT CAT AAA QCG ATC TCA CAA AGT GGA AGT GCT TTT AAT 63 0 
35 I 3u Phe His Lys Ala He Ser Gin Ser Gly Ser Ala Phe Asn 
200 205 210 

CCT TGG GCA CTT CAA CAT GAT AAT AAT AAA GAA AAT GCA TTC 672 
Pro Trp Ala Leu Gin His Asp Asn Asn Lys Glu Asn Ala Phe 
215 220 

40 CGC CTC TGC AAA CTT CTG GGT CAT CCT GTC GAT AAC GAG ACA 714 
Arg Leu Cys Lys Leu Leu Gly His Pro Val Asp Asn Glu Thr 
225 230 235 
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GAA GCT CTA AAA ATC CTT CGT CAA GCC CCC ATA GAT GAT CTT 756 
Glu Ala Leu Lys lie Leu Arg Gin Ala Pro He Asp Asp Leu 
240 " 24S 250 

ATA GAC AAC AGA ATA AAA CCA AAA GAC AAA GGC CAA CTT ATT 7 98 
5 He Asp Asn Arg He Lys Pro Lys Asp Lys Gly Gin Leu He 
255 260 265 

ATA GAC TAT CCT TTT CTA CCA ACA ATA GAA AAA CGT TAT CAA 840 
He Asp Tyr Pro Phe Leu Pro Thr He Glu Lys Arg Tyr Gin 
270 275 280 

10 AAT TTT GAA CCA TTC TTG GAC CAG TCT CCA TTA TCA AAA ATG 882 
Asn Phe Glu Pro Phe Leu Asp Gin Ser Pro Leu Ser Lys Met 
285 290 

CAA TCA GGC AAT TTC ACA AAA GTC CCA TTT ATA TGT GGA TAC 924 
Gin Ser Gly Asn Phe Thr Lys Val Pro Phe He Cys Gly Tyr 
15 295 300 305 

AAC AGT GCT GAA GGA ATT TTA GGT TTA ATG GAC TTC AAG GAT 966 
Asn Ser Ala Glu Gly He Leu Gly Leu Met Asp Phe Lys Asp 
310 3X5 320 

GAC CCA AAT ATA TTT GAG AAG TTT GAA GCT GAT TTT GAA AGA 1008 
20 Asp Pro Asn He Phe Glu Lys Phe Glu Ala Asp Phe Glu Arg 
325 330 335 

TTT GTA CCA GTA GAT TTG AAT CTA ACT TTA AGG TCT AAG GAA 1050 
Phe Val Pro Val Asp Leu Asn Leu Thr Leu Arg Ser Lys Glu 
340 345 350 

25 TCT AAA AAA TTG GCT GAA GAA ATG AGA AAG TTT TAT TAC CAA 1092 
Ser Lys Lys Leu Ala Glu Glu Met Arg Lys Phe Tyr Tyr Gin 
355 360 

GAC GAA CCT GTT TCT TCA GAC AAC AAA GAA AAA TTT GTC AGT 1134 
Asp Glu Pro Val Ser Ser Asp Asn Lys Glu Lys Phe Val Ser 
30 365 370 375 

GTT ATT AGT GAT ACT TGG TTT TTG AGA GGG ATT AAA AAT ACT 117 6 
val He Ser Asp Thr Trp Phe Leu Arg Gly He Lys Asn Thr 
380 385 390 

GCA AGA TAT ATA ATT GAA CAT TCC TCA GAA CCG TTA TAT TTA .218 
35 Ala Arg Tyr He He Glu His Ser Ser Glu Pro Leu Tyr Leu 
395 400 405 

TAT GTT TAT AGT TTT GAT GAT TTT GGT TTT TTG AAG AAA CTT 1260 
Tvr Val Tyr Ser Phe Asp Asp Phe Gly Phe Leu Lys Lys Leu 
410 415 420 

40 GTA TTA GAT CCT AAT ATT GAA GGA GCA GCT CAT GGA GAT GAG 13 0 2 
Val Leu Asp Pro Asn He Glu Gly Ala Ala His Gly Asp Glu 
425 430 
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CTG GGA TAT CTT TTC AAG ATG AGT TTT ACA GAA TTT CCA AAA 1344 
Leu Gly Tyr Leu Phe Lys Met Ser Phe Thr Glu Phe Pro Lys 
435 440 445 

GAT TTA CCA AGT GCA GTG GTG AAT AGG GAA CGA TTG TTG CAA 13 86 
5 Asp Leu Pro Ser Ala Val Val Asn Arg Glu Arg Leu Leu Gin 
450 455 460 

CTT TGG ACA AAT TTT GCA AAA ACA GGA AAT CCC ACT CCT GAA 142 8 
Leu Trp Thr Asn Phe Ala Lys Thr Gly Asn Pro Thr Pro Glu 
465 470 475 

10 ATC .AAT GAT GTT ATA ACA ACA AAA TGG GAT AAA GCT ACT GAG 147 0 
lie Asn Asp Val He Thr Thr Lys Trp Asp Lys Ala Thr Glu 
480 485 490 

GAA AAA TCA GAT CAT ATG GAT ATC GAT AAT ACT TTG AG A ATG 1512 
Glu Lys Ser Asp His Met Asp He Asp Asn Thr Leu Arg Met 
15 J 495 500 

ATT CCA GAT CCT GAT GCA AAA CGA CTT A 154 0 

He Pro Asp Pro Asp Ala Lys Arg Leu 
505 510 



(2) INFORMATION FOR SEQ ID NO:52: 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 84 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: CDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

TAAAAATTTA TTCCAAAATC TAAGTCGTTT TGCATCAGGA TCTGGAATCA 50 
TTCTCAAAGT ATTATCGATA TCCATATGAT CTGATTTTTC CTCAGTAGCT 10 0 



-173- FC-1-C1-PCT 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

5 (iii) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

TAAAAATTTA TTCCAAAATC TAAGTCGTTT TGCATCAGGA TCTGGAATCA 50 

TTCTCAAAGT ATTATCGATA TCCATATGAT CTGATTTTTC CTCAGTAGCT 100 

TTATCCCATT TTGTTGTTAT AACATCATTG ATTTCAGGAG TGGGATTTCC 15 0 

TGTTTTTGCA AAATTTGTCC AAAGTTGCAA CAATCGTTCC CTATTCACCA 2 00 

10 CTGCACTTGG TAAATCTTTT GGAAATTCTG TAAAACTCAT CTTGAAAAGA 2 50 

TATCCCAGCT CATCTCCATG AGCTGCTCCT TCAATATTAG GATCTAATAC 3 00 

AAGTTTCTTC AAAAAACCAA AATCATCAAA ACTATAAACA TATAAATATA 3 50 

ACGGTTCTGA GGAATGTTCA ATTATATATC TTGCAGTATT TTTAATCCCT 4 00 

CTCAAAAACC AAGTATCACT AATAACACTG ACAAATTTTT CTTTGTTGTC 4 50 

15 TGAAGAAACA GGTTCGTCTT GGTAATAAAA CTTTCTCATT TCTTCAGCCA 500 

ATTTTTTAGA TTCCTTAGAC CTTAAAGTTA GATTCAAATC TACTGGTACA 550 

AATCTTTCAA AATCAGCTTC AAACTTCTCA AATATATTTG GGTCATCCTT 600 

GAAGTCCATT AAACCTAAAA TTCCTTCAGC ACTGTTGTAT C C AC AT AT AA 650 

ATGGGACTTT TGTGAAATTG CCTGATTGCA TTTTTGATAA TGGAGACTGG 700 

20 TCCAAGAATG GTTCAAAATT TTGATAACGT TTTTCTATTG TTGGTAGAAA 7 50 

AGGATAGTCT ATAATAAGTT GGCCTTTGTC TTTTGGTTTT ATTCTGTTGT 8 00 

CTATAAGATC ATCTATGGGG GCTTGACGAA GGATTTTTAG AGCTTCTGTC 850 

TCGTTATCGA CAGGATGACC CAGAAGTTTG CAGAGGCGGA ATGCATTTTC 900 

TTTATTATTA TCATGTTGAA GTGCCCAAGG ATTAAAAGCA CTTCCACTTT 950 

25 GTGAGATCGC TTTATGAAAA AGTCCTTTGG AAAGATCTGA TAACATCAAA 100 0 

TAATGAACAC TTGCACCACC TGCTGATTCT C C AAAAAT AG TCACATTGTT 105 0 

GGGGTCACCA CCAAAGGATG CAATATTGTT TTTTAC CC AT TTTAGAGCTT 110 0 

CAACCTGGTC CATCAAACCA AC ATTGC CAG GCGCTTCTTC TATTCCCAGG 115 0 

TTCAAAAATC CCAAAACACC TAATCGATAA TTGAAAGTAA CCAGAACAAT 12 0 0 

30 TCCATAATCC ATCAAATATT CAGGACCATA CATATCACTA TTTCCAGATC 12 5 0 

CCATGAAGAA GCCTCCTCCA TGTATC CAT A CCATTACTGG AAGAAGTGAT 13 0 0 

TTCTCTGATG TTTTTGGTAC ATAGACATTG AGGTATAAAC AATCTTCAGC 13 50 

CCCTACTTTA ATTTTTTTAA TAAAATGTAC TGATCTACAA CTATTCCCTT 14 0 0 

CTTTACTAGC ATCAAGAACA CCTGACCAAG GTTCTGCAGG TTGAGGTGGC 14 5 0 

35 TTAAATCTTA GATCACCTAC AGGAGGTTTG GCATATGGAA TTCCAGAATA 15 0 0 

ACTATGGAAC ACATTTCCTT TTTCACTAAT TTGCTCTTTT CCTTTTAAAG 15 50 

TACCTTGAAG CAAAGTCACT TGTAGATCAG CCAT 15 84 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

40 (A) LENGTH : 53 0 amino acids 

(B) r PE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

45 Asp Pro Pro Thr Val Thr Leu Pro Gin Gly Glu Leu Val Gly 
15 10 



Lys Ala Leu Thr Asn Glu Asn Gly Lys Glu Tyr Phe Ser Tyr 
15 20 25 
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Thr Gly Val Pro Tyr Ala Lys Pro Pro Val Gly Glu Leu Arg 
30 35 40 

Phe Lys Pro Pro Gin Lys Ala Glu Pro Trp Asn Gly Val Phe 
45 50 S5 

5 Asn Ala Thr Ser His Gly Asn Val Cys Lys Ala Leu Asn Phe 
60 65 70 

Phe Leu Lys Lys lie Glu Gly Asp Glu Asp Cys Leu Leu Val 
75 80 ' 

Asn Val Tyr Ala Pro Lys Thr Thr Ser Asp Lys Lys Leu Pro 
10 85 ' 90 95 

Val Phe Phe Trp Val His Gly Gly Gly Phe Val Thr Gly Ser 
100 105 11° 

Gly Asn Leu Glu Phe Gin Ser Pro Asp Tyr Leu Val Asn Tyr 
115 120 125 

15 Asp Val He Phe Val Thr Phe Asn Tyr Arg Leu Gly Pro Leu 
130 135 140 

Gly Phe Leu Asn Leu Glu Leu Glu Gly Ala Pro Gly Asn Val 
145 150 

Gly Leu Leu Asp Gin Val Ala Ala Leu Lys Trp Thr Lys Glu 
20 155 160 165 

Asn He Glu Lys Phe Gly Gly Asp Pro Glu Asn He Thr He 
170 175 180 

Gly Gly Val Ser Ala Gly Gly Ala Ser Val His Tyr Leu Leu 
185 190 195 

25 Leu Ser His Thr Thr Thr Gly Leu Tyr Lys Arg Ala He Ala 

200 205 210 

Gin Ser Gly Ser Ala Leu Asn Pro Trp Ala Phe Gin Arg His 
215 220 

Pro Val Lys Arg Ser Leu Gin Leu Ala Glu He Leu Gly His 
30 225 230 235 

Pro Thr Asn Asn Thr Gin Asp Ala Leu Glu Phe Leu Gin Lys 
240 245 250 

Ala Pro Val Asp Ser Leu Leu Lys Lys Met Pro Ala Glu Thr 
255 260 265 

35 Glu Gly Glu He He Glu Glu Phe Val Phe Val Pro Ser He 
270 275 280 
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Glu Lys Val Phe Pro Ser His Gin Pro Phe Leu Glu Glu Ser 
285 290 

Pro Leu Ala Arg Met Lys Ser Gly Ser Phe Asn Lys Val Pro 
295 30° 305 

5 Leu Leu Val Gly Phe Asn Ser Ala Glu Gly Leu Leu Tyr Lys 
310 315 320 

Phe Phe Met Lys Glu Lys Pro Glu Met Leu Asn Gin Ala Glu 
325 330 335 

Ala Asp Phe Glu Arg Leu Val Pro Ala Glu Phe Glu Leu Ala 
10 340 345 350 

His Gly Ser Glu Glu Ser Lys Lys Leu Ala Glu Lys He Arg 
355 360 

Lys Phe Tyr Phe Asp Asp Lys Pro Val Pro Glu Asn Glu Gin 
365 370 375 

15 Lys Phe He Asp Leu He Gly Asp lie Trp Phe Thr Arg Gly 
380 385 390 

He Asp Lys His Val Lys Leu Ser Val Glu Lys Gin Asp Glu 
395 400 405 

Pro Val Tyr Tyr Tyr Glu Tyr Ser Phe Ser Glu Ser His Pro 
20 410 415 420 

Ala Lys Gly Thr Phe Gly Asp His Asn Leu Thr Gly Ala Cys 
425 430 

His Gly Glu Glu Leu Val Asn Leu Phe Lys Val Glu Met Met 
435 440 445 

25 Lys Leu Glu Lys Asp Lys Pro Asn Val Leu Leu Thr Lys Asp 

450 455 460 

Arg Val Leu Ala Met Trp Thr Asn Phe He Lys Asn Gly Asn 
465 470 475 

Pro Thr Pro Glu Val Thr Glu Leu Leu Pro Val Lys Trp Glu 
30 480 485 490 

Pro Ala Thr Lys Asp Lys Leu Asn Tyr Leu Asn He Asp Ala 
495 500 

Thr Leu Thr Leu Gly Thr Asn Pro Glu Glu Thr Arg Val Lys 
505 510 515 

35 Phe Trp Glu Asp Ala Thr Lys Thr Leu His Ser Gin 
520 525 530 
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(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 0 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Trp Asp Asn Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn 
1 5 10 

10 Gly lie Thr Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Arg 



15 



20 



25 



Asn Asp Val Tyr Cys Ser Tyr Leu Gly He Pro Tyr Ala Glu 



30 



35 



40 



Pro Pro Phe Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro He 
45 50 55 

Ser Asn Pro Lys Thr Gly Phe Val Gin Ala Arg Thr Leu Gly 



60 



65 70 



Asp Lys Cys Phe Gin Glu Ser Leu He Tyr Ser Tyr Ala Gly 
75 80 

20 Ser Glu Asp Cys Leu Tyr Leu Asn He Phe Thr Pro Glu Thr 



85 



90 95 



Val Asn Ser Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp 
100 105 HO 



He His Gly Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn 
25 H5 



120 125 



Phe Phe Gly Pro Asp Tyr Leu He Arg Glu Gly He He Leu 
130 135 I 40 

Val Thr He Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser 
145 150 

30 Ala Pro Glu Trp Asp He His Gly Asn Mst Gly Leu Lys Asp 
155 160 165 

Gin Arg Leu Ala Leu Lys Trp Val Tyr Asp Asn He Glu Lys 
170 175 180 

Phe Gly Gly Asp Arg Glu Lys He Thr He Ala Gly Glu Ser 
35 185 190 195 

Ala Gly Ala Ala Ser Val His Phe Leu Met Met Asp Asn Ser 
200 205 2X0 
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Thr Arg Lys Tyr Tyr Gin Arg Ala lie Leu Gin Ser Gly Thr 
215 220 

Leu Leu Asn Pro Thr Ala Asn Gin He Gin Leu Leu His Arg 
225 230 235 

5 Phe Glu Lys Leu Lys Gin Val Leu Asn He Thr Gin Lys Gin 
240 245 250 

Glu Leu Leu Asn Leu Asp Lys Asn Leu He Leu Arg Ala Ala 
255 260 265 

Leu Asn Arg Val Pro Asp Ser Asn Asp His Asp Arg Asp Thr 
10 270 275 280 

Val Pro Val Phe Asn Pro Val Leu Glu Ser Pro Glu Ser Pro 
285 290 

Asp Pro He Thr Phe Pro Ser Ala Leu Glu Arg Met Arg Asn 
295 300 305 

15 Gly Glu Phe Pro Asp Val Asp Val He He Gly Phe Asn Ser 
310 315 320 

Ala Glu Gly Leu Arg Ser Met Ala Arg Val Thr Arg Gly Asn 
325 330 335 

Met Glu Val His Lys Thr Leu Thr Asn He Glu Arg Ala He 
20 340 345 350 

Pro Arg Asp Ala Asn He Trp Lys Asn Pro Asn Gly He Glu 
355 360 

Glu Lys Lys Leu He Lys Met Leu Thr Glu Phe Tyr Asp Gin 
365 370 375 

25 Val Lys Glu Gin Asn Asp Asp He Glu Ala Tyr Val Gin Leu 
380 385 390 

Lys Gly Asp Ala Gly Tyr Leu Gin Gly He Tyr Arg Thr Leu 
395 400 405 

Lys Ala He Phe Phe Asn Glu Phe Arg Arg Asn Ser Asn Leu 
30 ' 4U 415 420 

Tyr Leu Tyr Arg Leu Ser Asp Asp Thr Tyr Ser Val Tyr Lys 
425 430 

Ser Tyr He Leu Pro Tyr Arg Trp Gly Ser Leu Pro Gly Val 
435 440 450 

35 Ser His Gly Asp Asp Leu Gly Tyr Leu Phe Ala Asn Ser Leu 
450 455 460 
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Asp Val Pro lie Leu Gly Thr Thr His He Ser lie Pro Gin 
465 470 475 

Asp Ala Met Gin Thr Leu Glu Arg Met Val Arg He Trp Thr 
480 485 490 

5 Asn Phe Val Lys Asn Gly Lys Pro Thr Ser Asn Thr Glu Asp 



495 



500 



Ala Ser Cys Asp Thr Lys Arg His Leu Asn Asp He Phe Trp 
505 " 510 515 



Glu Pro Tyr Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly 
10 520 



525 530 



Lys Glu Asn Phe Glu Met Lys Asn He Leu Glu Leu Lys Arg 
535 540 545 

Met Met Leu Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg 
550 555 560 

15 Phe Arg Val Cys Asn Glu Gly Ser He Arg 

565 570 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 amino acids 

20 (b) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

Trp Asp Asn Leu Asp Gin His Leu Cys Arg Val Gin Phe Asn 
25 i 5 io 

Gly He Thr Glu Gly Lys Pro Phe Arg Tyr Lys Asp His Lys 
15 20 25 

Asn Asp Val Tyr Cys Ser Tyr Leu Gly He Pro Tyr Ala Glu 
30 35 40 

30 Pro Pro He Gly Pro Leu Arg Phe Gin Ser Pro Lys Pro He 
45 50 55 

Ser Asn Pro Lys Thr Gly Phe Val Gin Ala Arg Ser Leu Gly 
60 65 70 

Asp Lys Cys Phe Gin Glu Ser Leu He Tyr Ser Tyr Ala Gly 
35 75 80 

Ser Glu Asp Cys Leu Tyr Leu Asn He Phe Thr Pro Glu Thr 
85 ~ 90 95 



s 
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val Asn Ser Ala Asn Asn Thr Lys Tyr Pro Val Met Phe Trp 



100 



105 



110 



lie His Gly Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn 
115 120 125 

5 Phe Phe Gly Pro Asp Tyr Leu He Arg Glu Gly He lie Leu 
130 . 135 140 

Val Thr He Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser 
145 150 " 

Ala Pro Glu Trp Asp He His Gly Asn Met Gly Leu Lys Asp 
10 155 160 165 

Gin Arg Leu Ala Leu Lys Trp Val Tyr Asp Asn lie Glu Lys 
170 175 I 80 

Phe Gly Gly Asp Arg Asp Lys He Thr He Ala Gly Glu Ser 
185 19° 195 

15 Ala Gly Ala Ala Ser Val His Phe Leu Met Met Asp Asn Ser 
200 205 210 

Thr Arg Lys Tyr Tyr Gin Arg Ala He Leu Gin Ser Gly Thr 



215 220 



Leu Leu Asn Pro Thr Ala Asn Gin He Gin Pro Leu His Arg 
20 225 230 235 

Phe Glu Lys Leu Lys Gin Val Leu Asn He Thr Gin Lys Gin 
240 



245 250 



Glu Leu Leu Asn Leu Asp Lys Asn Gin He Leu Arg Ala Ala 

260 265 



255 



25 Leu Asn Arg Val Pro Asp Asn Asn Asp His Glu Arg Asp Thr 
270 



275 280 



val Pro Val Phe Asn Pro Val Leu Glu Ser Pro Glu Ser Pro 
285 290 

Asp Pro lie Thr Phe Pro Ser Ala Leu Glu Arg Met Arg Asn 
30 295 300 305 



Gly 



Glu Phe Pro Asp Val Asp Val He He Gly Phe Asn Ser 



310 



315 320 



Ala Glu Gly Leu Arg Ser Met Pro Arg Val Thr Arg Gly Asn 



325 



330 335 



35 Met Glu val Tyr Lys Thr Leu Thr Asn He Glu Arg Ala He 
340 345 350 
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Pro Arg Asp Ala Asn lie Trp Lys Asn Pro Asn Gly He Glu 
355 360 

Glu Lys Lys Leu He Lys Met Leu Thr Glu Phe Tyr Asp Gin 
365 " 370 375 

5 val Lys Glu Gin Asn Asp Asp He Glu Ala Tyr Val Gin Leu 
380 385 390 

Lys Gly Asp Ala Gly Tyr Leu Gin Gly He Tyr Arg Thr Leu 
395 400 405 



Lvs Ala He Phe Phe Asn Glu He Lys Arg Asn Ser Asn Leu 
410 415 420 

Tyr Leu Tyr Arg Leu Ser Asp Asp Thr Tyr Ser Val Tyr Lys 



425 



430 



Ser Tyr He Leu Pro Tyr Arg Trp Gly Ser Leu Pro Gly Val 
435 440 445 

15 Ser His Gly Asp Asp Leu Gly Tyr Leu Phe Ala Asn Ser Leu 
450 455 460 

Asp Val Pro He Leu Gly Thr Thr His He Ser He Pro Gin 
465 470 475 

Asp Ala Met Gin Thr Leu Glu Arg Met Val Arg He Trp Thr 
20 480 485 490 

Asn Phe Val Lys Asn Gly Lys Pro Thr Ser Asn Thr Glu Asp 
495 500 

Ala Ser Cys Asp Thr Lys Arg His Leu Asn Asp He Phe Trp 
505 510 515 

! 

25 Glu Pro Tyr Asn Asp Glu Glu Pro Lys Tyr Leu Asp Met Gly 



520 



525 530 



Lys Glu His Phe Glu Met Lys Asn He Leu Glu Leu Lys Arg 
535 540 545 

Met Met Leu Trp Asp Glu Val Tyr Arg Asn Ala Asn Leu Arg 
30 550 555 560 

Phe Arg Val Cys Asn Glu Gly ot.£ He Arg 
565 570 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 20 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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20 



aca nrr gta cct tat gct aaa cct cct gtt gga gaa ctt aga 

Thi fly Val Pro Tyr Ala Lys Pro Pro Val Gly Glu Leu Arg 
50 * 55 60 



TTT AAG CCT CCA CAG AAA GCT GAG CCA TGG CAA GGT GTT TTC 
35 Phe Lys Pro Pro Gin Lys Ala Glu Pro Trp Gin Gly Val Phe 
65 70 75 



20 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
GTGCGTACAC GTTTACTACC 

5 (2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2144 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 30.. 1682 

15 (iv) FEATURE: 

(A) NAME/ KEY : Asx = Asn or Asp 

(B) LOCATION: 4 62 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GTACACATAG TCAATAGTCT AGATCCAAG ATG TCT CGT GTT ATT TTT 

Met Ser Arg Val He Phe 
1 5 

TTA AGT TGT ATT TTT TTG TTT AGT TTT AAT TTT ATA AAA TGT 
Leu Ser Cys He Phe Leu Phe Ser Phe Asn Phe He Lys Cys 
10 15 20 

25 GAT TCC CCG ACT GTA ACT TTG CCC CAA GGC GAA TTG GTT GGA 131 
Asp Ser Pro Thr Val Thr Leu Pro Gin Gly Glu Leu Val Gly 
25 30 

AAA GCT TTG ACG AAC GAA AAT GGA AAA GAG TAT TTT AGC TAC 17 3 

Lys Ala Leu Thr Asn Glu Asn Gly Lys Glu Tyr Phe Ser Tyr 
30 35 40 45 



47 



89 



215 



257 
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AAC GCC ACA TTA TAC GGA AAT GTG TGT AAA TCT TTA AAT TTC 299 
Asn Ala Thr Leu Tyr Gly Asn Val Cys Lys Ser Leu Asn Phe 
80 35 90 

TTC TTG AAG AAA ATT GAA GGA GAC GAA GAC TGC TTG GTA GTA 341 
5 Phe Leu Lys Lys He Glu Gly Asp Glu Asp Cys Leu Val Val 

95 100 

AAC GTG TAC GCA CCA AAA ACA ACT TCT GAT AAA AAA CTT CCA 3 83 

Asn Val Tyr Ala Pro Lys Thr Thr Ser Asp Lys Lys Leu Pro 
105 HO H5 

10 GTA TTT TTC TGG GTT CAT GGT GGT GGT TTT GTG ACT GGA TCC 425 
Val Phe Phe Trp Val His Gly Gly Gly Phe Val Thr Gly Ser 
120 125 130 

GGA AAT TTA GAA TTC CAA AGC CCA GAT TAT TTA GTA RAT TTT 4 67 

Gly Asn Leu Glu Phe Gin Ser Pro Asp Tyr Leu Val Asx Phe 
15 " 135 140 145 

GAT GTT ATT TTC GTA ACT TTC AAT TAC CGA TTG GGA CCT CTC 509 
Asd val He Phe Val Thr Phe Asn Tyr Arg Leu Gly Pro Leu 
150 155 160 

GGA TTT CTG AAT TTG GAG TTG GAG GGT GCT CCA GGA AAT GTA 551 
20 Gly Phe Leu Asn Leu Glu Leu Glu Gly Ala Pro Gly Asn Val 

165 170 

GGA TTA TTG GAT CAG GTG GCA GCT CTG AAA TGG ACC AAA GAA 5 93 

Gly Leu Leu Asp Gin Val Ala Ala Leu Lys Trp Thr Lys Glu 
17 5 180 185 

25 AAC ATT GAG AAA TTT GGT GGA GAT CCA GAA AAT ATT ACA ATT 635 
Asn He Glu Lys Phe Gly Gly Asp Pro Glu Asn He Thr He 
190 195 200 

GGT GGT GTT TCT GCT GGT GGA GCA AGT GTT CAT TAT CTT TTG 67 7 

Gly Gly Val Ser Ala Gly Gly Ala Ser Val His Tyr Leu Leu 
30 205 210 215 

TTA TCT CAT ACA ACC ACT GGA CTT TAC AAA AGG GCA ATT GCT 719 
Leu Ser His Thr Thr Thr Gly Leu Tyr Lys Arg Ala He Ala 
220 225 230 

CAA AGT GGA AGT GCT TTT AAT CCA TGG GCC TTC CAA AGA CAT 761 
35 Gin Ser Gly Ser Ala Phe Asn Pro Trp Ala Phe Gin Arg His 

235 240 

CCA GTA AAG CGT AGT CTT CAA CTT GCT GAG ATA TTG GGT CAT 803 
Pro Val Lys Arg Ser Leu Gin Leu Ala Glu He Leu Gly His 
245 250 255 



-183- 



FC-1-C1-PCT 



CCC ACA AAC AAT ACT CAA GAT GCT TTA GAA TTC TTA CAA AAA 845 
Pro Thr Asn Asn Thr Gin Asp Ala Leu Glu Phe Leu Gin Lys 
260 265 270 

GCC CCC GTA GAC AGT CTC CTG AAG AAA ATG CCA GCT GAA ACA 887 
5 Ala Pro Val Asp Ser Leu Leu Lys Lys Met Pro Ala Glu Thr 
275 280 285 

GAA GGT GAA ATA ATA GAA GAG TTT GTC TTC GTA CCA TCA ATT 92 9 

Glu Gly Glu He He Glu Glu Phe Val Phe Val Pro Ser He 
290 295 300 

10 GAA AAA GTT TTC CCA TCC CAC CAA CCT TTC TTG GAA GAA TCA 971 
Glu Lys Val Phe Pro Ser His Gin Pro Phe Leu Glu Glu Ser 
305 310 

CCA TTG GCC AGA ATG AAA TCC GGA TCC TTT AAC AAA GTA CCT 1013 
Pro Leu Ala Arg Met Lys Ser Gly Ser Phe Asn Lys Val Pro 
15 315 320 325 

TTA TTA GTT GGA TTT AAC AGT GCA GAA GGA CTT TTG TTC AAA 10 5 5 

Leu Leu Val Gly Phe Asn Ser Ala Glu Gly Leu Leu Phe Lys 
330 335 340 

TTC TTC ATG AAA GAA AAA CCA GAG ATG CTG AAC CAA GCT GAA 10 97 

20 Phe Phe Met Lys Glu Lys Pro Glu Met Leu Asn Gin Ala Glu 
345 350 355 

GCA GAT TTT GAA AGA CTC GTA CCA GCC GAA TTT GAA TTA GTC 113 9 

Ala Asp Phe Glu Arg Leu Val Pro Ala Glu Phe Glu Leu Val 
360 365 370 

25 CAT GGA TCA GAG GAA TCG AAA AAA CTT GCA GAA AAA ATC AGG 1181 
His Gly Ser Glu Glu Ser Lys Lys Leu Ala Glu Lys He Arg 
375 380 

AAG TTT TAC TTT GAC GAT AAA CCC GTT CCA GAA AAT GAA CAG 1223 
Lys Phe Tyr Phe Asp Asp Lys Pro Val Pro Glu Asn Glu Gin 
30 385 390 395 

AAA TTT ATT GAC TTG ATA GGA GAT ATT TGG TTT ACT AGA GGT 12 6 5 

Lys Phe He Asp Leu He Gly Asp He Trp Phe Thr Arg Gly 
400 405 410 

GTT GAC AAG CAT GTC AAG TTG TCT GTG GAG AAA CP\ <iAC GAA 13 07 

35 Val Asp Lys His Val Lys Leu Ser Val Glu Lys Gxr. Asp Glu 
415 420 425 

CCA GTT TAT TAT TAT GAA TAT TCC TTC TCG GAA AGT CAT CCT 13 4 9 

Pro Val Tyr Tyr Tyr Glu Tyr Ser Phe Ser Glu Ser His Pro 
430 435 440 
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GCA AAA GGA ACA TTT GGT GAT CAT AAT CTG ACT GGT GCA TGC 13 91 

Ala Lys Gly Thr Phe Gly Asp His Asn Leu Thr Gly Ala Cys 
445 450 

CAT GGA GAA GAA CTT GTG AAT TTA TTC AAA GTC GAG ATG ATG 14 3 3 

5 His Gly Glu Glu Leu Val Asn Leu Phe Lys Val Glu Met Met 
455 460 465 

AAG CTG GAA AAA GAT AAA CCT AAT GTT CTA TTA ACA AAA GAT 1475 

Lys Leu Glu Lys Asp Lys Pro Asn Val Leu Leu Thr Lys Asp 
470 475 480 

10 AGA GTA CTT GCC ATG TGG ACT AAC TTC ATC -AAA AAT GGA AAT 1517 

Arg Val- Leu Ala Met Trp Thr Asn Phe lie Lys Asn Gly Asn 
485 490 495 

CCT ACT CCT GAA GTA ACA GAA TTA TTG CCA GTT AAA TGG GAA 155 9 

Pro Thr Pro Glu Val Thr Glu Leu Leu Pro Val Lys Trp Glu 
15 500 505 510 

CCT GCC ACA AAA GAC AAG TTG AAT TAT TTG AAC ATT GAT GCC 1601 
Pro Ala Thr Lys Asp Lys Leu Asn Tyr Leu Asn lie Asp Ala 
515 520 

ACC TTA ACT TTG GGA ACA AAT CCT GAG GCA AAC CGA GTC AAA 164 3 

20 Thr Leu Thr Leu Gly Thr Asn Pro Glu Ala Asn Arg Val Lys 
525 530 535 

TTT TGG GAA GAC GCC ACA AAA TCT TTG CAC GGT CAA TAA 16 82 

Phe Trp Glu Asp Ala Thr Lys Ser Leu His Gly Gin 
540 545 550 

25 TAATTTATGA AAATTGTTTT AAATACTTTA GGTAATATAT TAGGTAAATA 17 3 2 

AAAATTAAAA AATAACAATT TTTATGTTTT ATGTATTGGC TTATGTGTAT 17 8 2 

CAGTTCTAAT TTTATTTATT TATTCTTGTT TTGCTTGTTT TGAAATATCA 1832 

TGGTTTTAAT TTTCAAAACA CAACGTCGTT TGTTTTTAGC AAAATTTC C A 18 82 

ATAGATATGT TATATTAAGT ACTCTGAAGT ATTTTTATAT ATACACTAAA 1932 

30 ATCAGTAAAA ATACATTAAC TAAAAATATA AGATATTTTC AATAATTTTT 1982 

TTTAAAGAAA AT AC C AAAAA TAAAGTAAAA TTCCAAACGG AATTTTTGTT 2 03 2 

TAACTTAAAA ATAAAATTAA CTCTTCAATA ATTTTGATAA TTAGTATTTC 2 0 82 

TGATATCATT AGTGAAAATT ATATTTTGAT AATACGTATT TATATTTAAA 213 2 

ATAAAATTAT GT 2144 

35 (2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE Of ^tACTERISTICS : 

(A) LENuf-i : 550 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE : protein 



(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
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Met Ser Arg Val lie Phe Leu Ser Cys lie Phe Leu Phe Ser 
1 ~ 5 10 

Phe Asn Phe He Lys Cys Asp Ser Pro Thr Val Thr Leu Pro 
15 20 25 

5 Gin Gly Glu Leu Val Gly Lys Ala Leu Thr Asn Glu Asn Gly 
30 35 40 

Lys Glu Tyr Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro 
45 50 55 

Pro Val Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu 
10 60 65 70 

Pro Trp Gin Gly Val Phe Asn Ala Thr Leu Tyr Gly Asn Val 
75 80 

Cys Lys Ser Leu Asn Phe Phe Leu Lys Lys He Glu Gly Asp 
85 90 95 

15 Glu Asp Cys Leu Val Val Asn Val Tyr Ala Pro Lys Thr Thr 
100 105 HO 

Ser Asp Lys Lys Leu Pro Val Phe Phe Trp Val His Gly Gly 
115 120 125 

Gly Phe Val Thr Gly Ser Gly Asn Leu Glu Phe Gin Ser Pro 
20 130 135 140 

Asp Tyr Leu Val Asx Phe Asp Val He Phe Val Thr Phe Asn 
145 150 

Tyr Arg Leu Gly Pro Leu Gly Phe Leu Asn Leu Glu Leu Glu 
155 160 165 

25 Gly Ala Pro Gly Asn Val Gly Leu Leu Asp Gin Val Ala Ala 
170 175 180 

Leu Lys Trp Thr Lys Glu Asn He Glu Lys Phe Gly Gly Asp 
185 190 195 

Pro Glu Asn He Thr He Gly Gly Val Ser Ala Gly Gly Ala 
30 200 205 210 

Ser Val His Tyr Leu Leu Leu Ser His Thr Thr Thr Gly Leu 
215 220 

Tyr Lys Arg Ala He Ala Gin Ser Gly Ser Ala Phe Asn Pro 
225 230 ' 235 



35 Trp Ala Phe Gin Arg His Pro Val Lys Arg Ser Leu Gin Leu 
240 245 250 
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Ala Glu lie Leu Gly His Pro Thr Asn Asn Thr Gin Asp Ala 
255 260 265 

Leu Glu Phe Leu Gin Lys Ala Pro Val Asp Ser Leu Leu Lys 
270 275 280 

5 Lys Met Pro Ala Glu Thr Glu Gly Glu He He Glu Glu Phe 

285 290 

Val Phe Val Pro Ser He Glu Lys Val Phe Pro Ser His Gin 
295 300 305 

Pro Phe Leu Glu Glu Ser Pro Leu Ala Arg Met Lys Ser Gly 
10 310 315 320 

Ser Phe Asn Lys Val Pro Leu Leu Val Gly Phe Asn Ser Ala 
325 330 335 

Glu Gly Leu Leu Phe Lys Phe Phe Met Lys Glu Lys Pro Glu 
340 345 350 

15 Met Leu Asn Gin Ala Glu Ala Asp Phe Glu Arg Leu Val Pro 

355 360 

Ala Glu Phe Glu Leu Val His Gly Ser Glu Glu Ser Lys Lys 
365 370 375 

Leu Ala Glu Lys He Arg Lys Phe Tyr Phe Asp Asp Lys Pro 
20 380 * 385 390 

Val Pro Glu Asn Glu Gin Lys Phe He Asp Leu He Gly Asp 
395 400 405 

He Trp Phe Thr Arg Gly Val Asp Lys His Val Lys Leu Ser 
410 415 420 

25 Val Glu Lys Gin Asp Glu Pro Val Tyr Tyr Tyr Glu Tyr Ser 

425 430 

Phe Ser Glu Ser His Pro Ala Lys Gly Thr Phe Gly Asp His 
435 440 445 

Asn Leu Thr Gly Ala Cys His Gly Glu Glu Leu Val Asn Leu 
30 450 455 460 

Phe Lys Val Glu Met Met Lys Leu Glu Lys Asp Lys Pro Asn 
465 470 475 

Val Leu Leu Thr Lys Asp Arg Val Leu Ala Met Trp Thr Asn 
480 485 490 

35 Phe He Lys Asn Gly Asn Pro Thr Pro Glu Val Thr Glu Leu 

495 500 
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Leu Pro Val Lys Trp Glu Pro Ala Thr Lys Asp Lys Leu Asn 
505 510 515 

Tyr Leu Asn lie Asp Ala Thr Leu Thr Leu Gly Thr Asn Pro 
520 525 530 

5 Glu Ala Asn Arg Val Lys Phe Trp Glu Asp Ala Thr Lys Ser 
535 540 545 

Leu His Gly Gin 
550 

(2) INFORMATION FOR SEQ ID NO: 59: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2144 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

ACATAATTTT ATTTTAAATA TAAATACGTA TTATCAAAAT ATAATTTTCA 5 0 

CTAATGATAT CAGAAATACT AATTATCAAA ATTATTGAAG AGTTAATTTT 100 

ATTTTTAAGT TAAACAAAAA TTCCGTTTGG AATTTTACTT TATTTTTGGT 150 

20 ATTTTCTTTA AAAAAAATTA TTGAAAATAT CTTATATTTT TAGTTAATGT 2 00 

ATTTTTACTG ATTTTAGTGT ATATATAAAA ATACTTCAGA GTACTTAATA 2 50 

TAACATATCT ATTGGAAATT TTGCTAAAAA CAAACGACGT TGTGTTTTGA 3 00 

AAATTAAAAC CATGATATTT CAAAACAAGC AAAACAAGAA TAAATAAATA 350 

AAATTAGAAC TGATACACAT AAGC CAATAC ATAAAACATA AAAATTGTTA 4 00 

25 TTTTTTAATT TTTATTTACC TAATATATTA CCTAAAGTAT TTAAAACAAT 450 

TTTCATAAAT TATTATTGAC CGTGCAAAGA TTTTGTGG CG TCTTCCCAAA 500 

ATTTGACTCG GTTTGCCTCA GGATTTGTTC CCAAAGTTAA GGTGGCATCA 55 0 

ATGTTCAAAT AATTCAACTT GTCTTTTGTG GCAGGTTCCC ATTTAACTGG 600 

CAATAATTCT GTTACTTCAG GAGTAGGATT TCCATTTTTG ATGAAGTTAG 65 0 

30 TCCACATGGC AAGTACTCTA TCTTTTGTTA ATAGAACATT AGGTTTATCT 700 

TTTTCCAGCT TCATCATCTC GACTTTGAAT AAATTCACAA GTTCTTCTCC 750 

ATGGCATGCA CCAGTCAGAT TATGATCACC AAATGTTCCT TTTGCAGGAT 8 00 

GACTTTCCGA GAAGGAATAT TCATAATAAT AAACTGGTTC GTCTTGTTTC 85 0 

TCCACAGACA ACTTGACATG CTTGTCAACA CCTCTAGTAA ACCAAATATC 900 

35 TCCTATCAAG TCAATAAATT TCTGTTCATT TTCTGGAACG GGTTTATCGT 950 

CAAAGTAAAA CTTCCTGATT TTTTCTGCAA GTTTTTTCGA TTCCTCTGAT 1000 

CCATGGACTA ATTCAAATTC GGCTGGTACG AGTCT^TCAA AATCTGCTTC 105 0 

AGCTTGGTTC AGCATCTCTG GTTTTTCTTT CATGA^CAAT TTGAACAAAA 110 0 

GTCCTTCTGC ACTGTTAAAT CCAACTAATA AAGGTACTTT GTTAAAGGAT 1150 

40 CCGGATTTCA TTCTGGCCAA TGGTGATTCT TCCAAGAAAG GTTGGTGGGA 12 0 0 

TGGGAAAACT TTTTCAATTG ATGGTACGAA GACAAACTCT TCTATTATTT 12 5 0 

CACCTTCTGT TTCAGCTGGC ATTTTCTTCA GGAGACTGTC TACGGGGGCT 13 0 0 

TTTTGTAAGA ATTCTAAAGC ATCTTGAGTA TTGTTTGTGG GATGACCCAA 13 5 0 

TATCTCAGCA AGTTGAAGAC TACG CTTTAC TGGATGTCTT TGGAAGGCCC 14 0 0 

45 ATGGATTAAA AGCACTTCCA CTTTGAGCAA TTGCCCTTTT GTAAAGTC C A 14 5 0 

GTGGTTGTAT GAGATAACAA AAGATAATGA ACACTTGCTC CACCAGCAGA 15 0 0 
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AACACCACCA ATTGTAATAT TTTCTGGATC TCCACCAAAT TTCTCAATGT 1550 

TTTCTTTGGT CCATTTCAGA GCTGCCACCT GATCCAATAA TCCTACATTT 160 0 

CCTGGAGCAC CCTCCAACTC CAAATTCAGA AATCCGAGAG GTCCCAATCG 165 0 

GTAATTGAAA GTTACGAAAA TAACATCAAA ATYTACTAAA TAATCTGGGC 17 0 0 

5 TTTGGAATTC TAAATTTCCG GATCCAGTCA CAAAACCACC ACCATGAACC 1750 

CAGAAAAATA CTGGAAGTTT TTTATCAGAA GTTGTTTTTG GTGCGTACAC 1800 

GTTTACTACC AAGCAGTCTT CGTCTCCTTC AATTTTCTTC AAGAAGAAAT 1850 

TTAAAGATTT ACACACATTT CCGTATAATG TGGCGTTGAA AACACCTTGC 1900 

CATGGCTCAG CTTTCTGTGG AGGCTTAAAT CTAAGTTCTC CAACAGGAGG 1950 

10 TTTAGCATAA GGTACACCTG TGTAGCTAAA ATACTCTTTT C C ATTTTCGT 2000 

TCGTCAAAGC TTTTCCAACC AATTCGCCTT GGGGCAAAGT TACAGTCGGG 2 050 

GAATCACATT TTATAAAATT AAAACTAAAC AAAAAAATAC AACTTAAAAA 2100 

AATAACACGA GACATCTTGG ATCTAGACTA TTGACTATGT GTAC 2144 

(2) INFORMATION FOR SEQ ID NO: 60: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

20 (ii) MOLECULE TYPE: CDNA 

(iii) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1650 

(iv) FEATURE: 

25 (A) NAME/KEY: Asx = Asn or Asp 

(B) LOCATION: 43 3 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ATG TCT CGT GTT ATT TTT TTA AGT TGT ATT TTT TTG TTT AGT 4 2 

Met Ser Arg Val lie Phe Leu Ser Cys lie Phe Leu Phe Ser 
30 1 5 10 

TTT AAT TTT ATA AAA TGT GAT TCC CCG ACT GTA ACT TTG CCC 84 
Phe Asn Phe lie Lys Cys Asp Ser Pro Thr Val Thr Leu Pro 
15 20 25 

CAA GGC GAA TTG GTT GGA AAA GCT TTG ACG AAC GAA AAT GGA 12 6 

35 Gin Gly Glu Leu Val Gly Lys Ala Leu Thr Asn Glu Asn Gly 
30 35 40 

AAA GAG TAT TTT AGC TAC ACA GGT GTA CCT TAT GCT AAA CCT 16 8 

Lys Glu Tyr Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro 
45 50 55 

40 CCT GTT GGA GAA CTT AGA TTT AAG CCT CCA CAG AAA GCT GAG 210 
Pro Val Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu 
60 65 70 



-189- 



CCA TGG 
Pro Trp 



TGT AAA 
5 Cys Lys 
85 

GAA GAC 
Glu Asp 
100 

10 TCT GAT 
Ser Asp 



GGT TTT 
Gly Phe 

15 

GAT TAT 
Asp Tyr 



TAC CGA 
20 Tyr Arg 
155 

GGT GCT 
Gly Ala 
170 

25 CTG AAA 
Leu Lys 



CCA GAA 
Pro Glu 

30 

AGT GTT 
Ser Val 



TAC AAA 
35 Tyr Lys 
225 

TGG GCC 
Trp Ala 
240 

40 GCT GAG 
Ala Glu 



CAA GGT GTT 
Gin Gly Val 
75 

TCT TTA AAT 
Ser Leu Asn 



TGC TTG GTA 
Cys Leu Val 

AAA AAA CTT 
Lys Lys Leu 
115 

GTG ACT GGA 
Val Thr Gly 
130 

TTA GTA RAT 
Leu Val Asx 
145 

TTG GGA CCT 
Leu Gly Pro 



CCA GGA AAT 
Pro Gly Asn 



TGG ACC AAA 
Trp Thr Lys 
185 

AAT ATT ACA 

Asn He Thr 
200 

CAT TAT CTT 
His Tyr Leu 
215 

AGG GCA ATT 
Arg Ala He 



TTC CAA AGA 
Phe Gin Arg 



ATA TTG GGT 
He Leu Gly 
255 



TTC AAC GCC 
Phe Asn Ala 



TTC TTC TTG 
Phe Phe Leu 
90 

GTA AAC GTG 
Val Asn Val 
105 

CCA GTA TTT 
Pro Val Phe 
120 

TCC GGA AAT 
Ser Gly Asn 



TTT GAT GTT 
Phe Asp Val 



CTC GGA TTT 
Leu Gly Phe 
160 

GTA GGA TTA 
Val Gly Leu 
175 

GAA AAC ATT 
Glu Asn He 
190 

ATT GGT GGT 
He Gly Gly 

TTG TTA TCT 
Leu Leu Ser 



GCT CAA AGT 
Ala Gin Ser 
230 

CAT CCA GTA 
His Pro Val 
245 

CAT CCC ACA 
His Pro Thr 
260 



ACA TTA TAC 
Thr Leu Tyr 
80 

AAG AAA ATT 
Lys Lys He 
95 

TAC GCA CCA 
Tyr Ala Pro 



TTC TGG GTT 
Phe Trp Val 



TTA GAA TTC 
Leu Glu Phe 
135 

ATT TTC GTA 
He Phe Val 
150 

CTG .AAT TTG 
Leu Asn Leu 
165 

TTG GAT CAG 
Leu Asp Gin 

GAG AAA TTT 
Glu Lys Phe 



GTT TCT GCT 
Val Ser Ala 
205 

CAT ACA ACC 
His Thr Thr 
220 

GGA AGT GCT 
Gly Ser Ala 
235 

AAG CGT AGT 
Lys Arg Ser 



AAC AAT ACT 
Asn Asn Thr 



GGA AAT GTG 
Gly Asn Val 



GAA GGA GAC 
Glu Gly Asp 



AAA ACA ACT 
Lys Thr Thr 
110 

CAT GGT GGT 
His Gly Gly 
125 

CAA AGC CCA 
Gin Ser Pro 
140 

ACT TTC AAT 
Thr Phe Asn 



GAG TTG GAG 
Glu Leu Glu 



GTG GCA GCT 
Val Ala Ala 
180 

GGT GGA GAT 
Gly Gly Asp 
195 

GGT GGA GCA 
Gly Gly Ala 
210 

ACT GGA CTT 
Thr Gly Leu 



TTT AAT CCA 
Phe Asn Pro 



CTT CAA CTT 
Leu Gin Leu 
250 

CAA GAT GCT 
Gin Asp Ala 
265 
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294 

336 

378 

420 

462 

504 

546 

588 

630 

672 

714 

756 

798 



-190- 



TTA GAA 
Leu Glu 



AAA ATG 
5 Lys Met 



GTC TTC 
Val Phe 
295 

10 CCT TTC 
Pro Phe 
310 

TCC TTT 
Ser Phe 

15 

GAA GGA 
Glu Gly 



ATG CTG 
20 Met Leu 



GCC GAA 
Ala Glu 
365 

25 CTT GCA 
Leu Ala 
380 

GTT CCA 
Val Pro 

30 

ATT TGG 
lie Trp 



GTG GAG 
35 Val Glu 



TTC TCG 
Phe Ser 
435 



TTC TTA CAA 
Phe Leu Gin 
270 

CCA GCT GAA 
Pro Ala Glu 
285 

GTA CCA TCA 
Val Pro Ser 



TTG GAA GAA 
Leu Glu Glu 



AAC AAA GTA 
Asn Lys Val 
325 

CTT TTG TTC 
Leu Leu Phe 
340 

AAC CAA GCT 
Asn Gin Ala 
355 

TTT GAA TTA 
Phe Glu Leu 



GAA AAA ATC 
Glu Lys He 



GAA AAT GAA 
Glu Asn Glu 
395 

TTT ACT AGA 
Phe Thr Arg 
410 

AAA CAA GAC 
Lys Gin Asp 
425 

GAA AGT CAT 
Glu Ser His 



AAA GCC CCC 
Lys Ala Pro 



ACA GAA GGT 
Thr Glu Gly 



ATT GAA AAA 
He Glu Lys 
300 

TCA CCA TTG 
Ser Pro Leu 
315 

CCT TTA TTA 
Pro Leu Leu 
330 

AAA TTC TTC 
Lys Phe Phe 



GAA GCA GAT 
Glu Ala Asp 



GTC CAT GGA 
Val His Gly 
370 

AGG AAG TTT 
Arg Lys Phe 
385 

CAG AAA TTT 
Gin Lys Phe 
400 

GGT GTT GAC 
Gly Val Asp 



GAA CCA GTT 
Glu Pro Val 



CCT GCA AAA 
Pro Ala Lys 
440 



GTA GAC AGT 
Val Asp Ser 
275 

GAA ATA ATA 
Glu He He 
290 

GTT TTC CCA 
Val Phe Pro 
305 

GCC AGA ATG 
Ala Arg Met 



GTT GGA TTT 
Val Gly Phe 



ATG AAA GAA 
Met Lys Glu 
345 

TTT GAA AGA 
Phe Glu Arg 
360 

TCA GAG GAA 
Ser Glu Glu 
375 

TAC TTT GAC 
Tyr Phe Asp 



ATT GAC TTG 
He Asp Leu 



AAG CAT GTC 
Lys His Val 
415 

TAT TAT TAT 
Tyr Tyr Tyr 
430 

GGA ACA TTT 
Gly Thr Phe 
445 



CTC CTG AAG 
Leu Leu Lys 
280 

GAA GAG TTT 
Glu Glu Phe 



TCC CAC CAA 
Ser His Gin 



AAA TCC GGA 
Lys Ser Gly 
320 

AAC AGT GCA 
Asn Ser Ala 
335 

AAA CCA GAG 
Lys Pro Glu 
350 

CTC GTA CCA 
Leu Val Pro 



TCG AAA AAA 
Ser Lys Lys 



GAT AAA CCC 
Asp Lys Pro 
390 

ATA GGA GAT 
He Gly Asp 
405 

AAG TTG TCT 
Lys Leu Ser 
420 

GAA TAT TCC 
Glu Tyr Ser 



GGT GAT CAT 
Gly Asp His 
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.191- FC-1-C1-PCT 

AAT CTG ACT GGT GCA TGC CAT GGA GAA GAA CTT GTG AAT TTA 13 86 

Asn Leu Thr Gly Ala Cys His Gly Glu Glu Leu Val Asn Leu 
450 455 460 

TTC AAA GTC GAG ATG ATG AAG CTG GAA AAA GAT AAA CCT AAT 1428 
5 Phe Lys Val Glu Met Met Lys Leu Glu Lys Asp Lys Pro Asn 
465 470 475 

GTT CTA TTA ACA AAA GAT AGA GTA CTT GCC ATG TGG ACT AAC 147 0 

Val Leu Leu Thr Lys Asp Arg Val Leu Ala Met Trp Thr Asn 
480 485 490 

10 TTC ATC AAA AAT GGA AAT CCT ACT CCT GAA GTA ACA GAA TTA 1512 
Phe lie Lys Asn Gly Asn Pro Thr Pro Glu Val Thr Glu Leu 
495 500 

TTG CCA GTT AAA TGG GAA CCT GCC ACA AAA GAC AAG TTG AAT 15 54 

Leu Pro Val Lys Trp Glu Pro Ala Thr Lys Asp Lys Leu Asn 
15 505 510 515 

TAT TTG AAC ATT GAT GCC ACC TTA ACT TTG GGA ACA AAT CCT 15 96 

Tyr Leu Asn He Asp Ala Thr Leu Thr Leu Gly Thr Asn Pro 
520 525 530 

GAG GCA AAC CGA GTC AAA TTT TGG GAA GAC GCC ACA AAA TCT 163 8 

20 Glu Ala Asn Arg Val Lys Phe Trp Glu Asp Ala Thr Lys Ser 
535 540 545 

TTG CAC GGT CAA 1650 
Leu His Gly Gin 
550 

25 (2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

TTGACCGTGC AAAGATTTTG TGGCGTCTTC CCAAAATTTG ACTCGGTTTG 5 0 

CCTCAGGATT TGTTCCCAAA GTTAAGGTGG C /TCAATGTT CAAATAATTC 10 0 

35 AACTTGTCTT TTGTGGCAGG TTCCCATTT~ .CTGGCAATA ATTCTGTTAC 15 0 

TTCAGGAGTA GGATTTCCAT TTTTGATGAA GTTAGTCCAC ATGGCAAGTA 200 

CTCTATCTTT TGTTAATAGA ACATTAGGTT TATCTTTTTC CAGCTTCATC 2 50 

ATCTCGACTT TGAATAAATT CACAAGTTCT TCTCCATGGC ATGCACCAGT 3 00 

CAGATTATGA TCACCAAATG TTCCTTTTGC AGGATGACTT TCCGAGAAGG 3 50 

40 AATATTCATA ATAATAAACT GGTTCGTCTT GTTTCTCCAC AGACAACTTG 4 00 

ACATGCTTGT CAACACCTCT AGTAAACCAA ATATCTCCTA TCAAGTCAAT 450 

AAATTTCTGT TCATTTTCTG GAACGGGTTT ATCGTCAAAG TAAAACTTCC 500 

TGATTTTTTC TGCAAGTTTT TTCGATTCCT CTG ATC C ATG GACTAATTCA 550 

AATTCGGCTG GTACGAGTCT TTCAAAATCT GCTTCAGCTT GGTTCAGCAT 600 
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CTCTGGTTTT TCTTTCATGA AGAATTTGAA CAAAAGTCCT TCTGCACTGT 650 

TAAATCCAAC TAATAAAGGT ACTTTGTTAA AGGATCCGGA TTTCATTCTG 700 

GCCAATGGTG ATTCTTCCAA GAAAGGTTGG TGGGATGGGA AAACTTTTTC 750 

AATTGATGGT ACGAAGACAA ACTCTTCTAT TATTTCACCT TCTGTTTCAG 8 00 

5 CTGGCATTTT CTTCAGGAGA CTGTCTACGG GGGCTTTTTG TAAGAATTCT 850 

AAAGCATCTT GAGTATTGTT TGTGGGATGA CCCAATATCT C AG C AAGTTG 900 

AAGACTACGC TTTACTGGAT GTCTTTGGAA GGCCCATGGA TTAAAAG C AC 950 

TTCCACTTTG AGCAATTGCC CTTTTGTAAA GTCCAGTGGT TGTATGAGAT 1000 

AACAAAAGAT AATGAACACT TGCTCCACCA GCAGAAACAC CACCAATTGT 105 0 

10 AATATTTTCT GGATCTCCAC CAAATTTCTC AATGTTTTCT TTGGTCCATT 110 0 

TCAGAGCTGC CACCTGATCC AATAATCCTA CATTTCCTGG AGCACCCTCC 115 0 

AACTCCAAAT TCAGAAATCC GAGAGGTCCC AATCGGTAAT TGAAAGTTAC 12 0 0 

GAAAATAACA TCAAAATYTA CTAAATAATC TGGGCTTTGG AATTCTAAAT 12 5 0 

TTCCGGATCC AGTCACAAAA CCACCACCAT GAACCCAGAA AAATACTGGA 13 0 0 

15 AGTTTTTTAT CAGAAGTTGT TTTTGGTGCG TACACGTTTA CTACCAAGCA 13 50 

GTCTTCGTCT CCTTCAATTT TCTTCAAGAA GAAATTTAAA GATTTACACA 14 00 

CATTTCCGTA TAATGTGGCG TTGAAAACAC CTTGCCATGG CTCAGCTTTC 14 5 0 

TGTGGAGGCT TAAATCTAAG TTCTCCAACA GGAGGTTTAG CATAAGGTAC 15 0 0 

ACCTGTGTAG CTAAAATACT CTTTTCCATT TTCGTTCGTC AAAGCTTTTC 15 50 

20 CAACCAATTC GCCTTGGGGC AAAGTTACAG TCGGGGAATC ACATTTTATA 16 00 

AAATTAAAAC TAAACAAAAA AATACAACTT AAAAAAATAA CACGAGACAT 165 0 



(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 2 9 bases 

25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
30 AAACTCGAGT CCCCCGACTG TAACTTTGC 

(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 bases 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
TCATCTGCAG TTATTGACTG TGCAAAGTTT TTGTGG 
40 (2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



-193- 



FC-1-C1-PCT 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(iii) SEQUENCE DESCRIPTION : SEQ ID NO : 64 : 
TTCCGGATCC GGCTGATCTA CAAGTGACTT TG 32 

5 (2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO : 65 : 
TGGTACTCGA GTCATAAAAA TTTATTCCAA AATC 34 

(2) INFORMATION FOR SEQ ID NO: 66: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: primer 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:66: 
AAAACTGCAG TATAAATATG TTACCTCACA GTGCATTAG 3 9 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 1987 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

30 (ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1650 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

AATTCACAGT GTAAATAATT TTATTTGATA TAAATGTATT TAATTTTTAT 50 
35 TTTAATCTAA TTTTAATTTA AATATATATA GTTTTATTTA TAAAAAAATA 100 
TTTTTTTTAT GATCGAAAAG AAATTTTTAT TTATGTTTAT GAGTGTGTGT 15 0 



-194- FC-1-C1-PCT 

TTTGGCTATG ATTTACATTA TTTTTGAGCT AGTATAAAAT TAAACCATAT 20 0 

TATATTTTGG ATATATAATA ACATTTTATA ATG TGT GAT CCA TTA 24 5 

Met Cys Asp Pro Leu 
1 5 

5 CTA AAA ACA ACA ACA TAT GGA ATT CTG AAA GGC AAG AAA GTT 2 87 

Leu Lys Thr Thr Thr Tyr Gly He Leu Lys Gly Lys Lys Val 
10 15 

GTA AAC GAA AAT GGT AAA ATT TAC TAT AGT TAC ACA GGT ATA 32 9 
Val Asn Glu Asn Gly Lys He Tyr Tyr Ser Tyr Thr Gly He 
10 20 25 -30 

CCC TAT GCA AAA TCT CCT GTA AAT GAT CTC AGA TTC AAG CCA 3 71 

Pro Tyr Ala Lys Ser Pro Val Asn Asp Leu Arg Phe Lys Pro 
35 40 45 

CCA CAA AAA CTT GAT CCT TGG AAT GGT GTT TTT GAC GCC ACT 413 
15 Pro Gin Lys Leu Asp Pro Trp Asn Gly Val Phe Asp Ala Thr 
50 55 60 

CAG TAT GGA AAT AAT TGT GCT GCT GGG AAA TGG TTT TTG AAA 4 55 

Gin Tyr Gly Asn Asn Cys Ala Ala Gly Lys Trp Phe Leu Lys 
65 70 75 

20 TCA GCT GGG GGT TGC GAA GAT TGC CTT TAC TTA AAT ATC TAT 4 97 
Ser Ala Gly Gly Cys Glu Asp Cys Leu Tyr Leu Asn He Tyr 
80 85 

GTC CCA CAA AAC ACT TCA GAA AAT CCT TTG CCA GTA ATG TTT 53 9 

Val Pro Gin Asn Thr Ser Glu Asn Pro Leu Pro Val Met Phe 
25 90 95 100 

TGG ATT CAT GGA GGA GCA TTT GTG GTC GGA TCA GGA AAT TCT 581 
Trp He His Gly Gly Ala Phe Val Val Gly Ser Gly Asn Ser 
105 HO H5 

GAT ATA CAT GGT CCT GAT TAT TTA ATA GAA TAT GAT ATT ATC 62 3 

30 Asp He His Gly Pro Asp Tyr Leu He Glu Tyr Asp He He 
120 125 130 

TTA GTA ACT ATT AAT TAT CGT CTA GGA CCA CTT GGT TTT CTT 665 
Leu Val Thr He Asn Tyr Arg Leu Gly Pro Leu Gly Phe Leu 
135 140 145 

35 AAT TTG GAA ATC GAA GAT GCG CCT GGG AAT GTT GGA if, ATG 7 07 

Asn Leu Glu He Glu Asp Ala Pro Gly Asn Val Gly Leu Met 
150 155 

GAT CAA GTT GCA GCC CTA AAA TGG GTA AAT GAA AAT ATT GCA 74 9 

Asp Gin Val Ala Ala Leu Lys Trp Val Asn Glu Asn He Ala 
40 160 165 170 
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ACC TTT 
Thr Phe 
175 

ACT GCT 
5 Thr Ala 



CTT ACC 
Leu Thr 



10 AGT GCT 
Ser Ala 



AAT GCA 
Asn Ala 
15 230 

AAC CTT 
Asn Leu 
245 

GAA ACA 
20 Glu Thr 



CAA CTG 
Gin Leu 



25 ACA TTT 
Thr Phe 



CCA ATA 
Pro lie 
30 300 

ACA GGT 
Thr Gly 
315 

TTA AAA 
35 Leu Lys 



TTT GAA 
Phe Glu 



AGT GGA GAC 
Ser Gly Asp 



GGA GCT GCA 
Gly Ala Ala 
190 

AAA GGT TTA 
Lys Gly Leu 
205 

TTT AAT CCC 
Phe Asn Pro 
220 

CTT CGA CTA 
Leu Arg Leu 

CAA GAA GCC 
Gin Glu Ala 



TTG TTA AAT 
Leu Leu Asn 
260 

CTG GAT GAC 
Leu Asp Asp 
275 

CCA GAA CAA 
Pro Glu Gin 
290 

ATA AAT TCA 
lie Asn Ser 



TAC AAC AGT 
Tyr Asn Ser 



ACA GAT CCA 
Thr Asp Pro 
330 

AGA TTT ATA 
Arg Phe lie 
345 



CCA AAA AAT 
Pro Lys Asn 
180 

AGT GTA CAT 
Ser Val His 
195 

TTC CAC AAG 
Phe His Lys 



TGG GCT TTC 
Trp Ala Phe 



TGC AAA ACC 
Cys Lys Thr 
235 

TTG GAT TTT 
Leu Asp Phe 
250 

ACC AAA TTA 
Thr Lys Leu 
265 

TTC GTG TTT 
Phe Val Phe 



GAT TCG TAC 
Asp Ser Tyr 



GGA AAA TTC 
Gly Lys Phe 
305 

GCC GAA GGC 
Ala Glu Gly 
320 

GAT T\ TTA 
Asp j^e^- Leu 
335 

CCA ACT GAC 
Pro Thr Asp 



ATT ACA ATT 
lie Thr lie 



TAT CAC ATT 
Tyr His lie 



GCT ATA GCA 
Ala lie Ala 
210 

CAA AAA AAT 
Gin Lys Asn 
225 

TTA GGC CTT 
Leu Gly Leu 
240 

TTG AAA AAC 
Leu Lys Asn 

CCC CAA GAA 
Pro Gin Glu 



GTA CCT TCG 
Val Pro Ser 
280 

TTA ACT GAC 
Leu Thr Asp 
295 

CAC AAA GTT 
His Lys Val 
310 

AAT CTA TTT 
Asn Leu Phe 



AAT AAA TTT 
Asn Lys Phe 



TTA GAA TTA 
Leu Glu Leu 
350 



TGT GGA GCA 
Cys Gly Ala 
185 

TTG TCA CAA 
Leu Ser Gin 
200 

CAA AGT GGA 
Gin Ser Gly 
215 

CCT GTT AAG 
Pro Val Lys 



ACC ACA AAC 
Thr Thr Asn 



CTA CCA GTA 
Leu Pro Val 
255 

ATT GAT GGT 
lie Asp Gly 
270 

ATT GAA AAA 
lie Glu Lys 
285 

TTG CCA ATA 
Leu Pro lie 



CCA TTG TTG 
Pro Leu Leu 



TTC ATG TAC 
Phe Met Tyr 
325 

GAA GCT GAT 
Glu Ala Asp 
340 

CCT TTG CGA 
Pro Leu Arg 
355 



FC-1-C1-PCT 

791 



833 



875 



917 



959 



1001 



1043 



1085 



1127 



1169 



1211 



1253 



1295 
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TCA CAA AAA TCT ATT GCA CTG GGT GAA GCA ATC AGG GAA TTT 13 3 7 
Ser Gin Lys Ser lie Ala Leu Gly Glu Ala lie Arg Glu Phe 
360 365 

TAT TTC CAA AAC AAA ACC ATA TCA GAA AAT ATG CAG AAT TTT 13 7 9 
5 Tyr Phe Gin Asn Lys Thr lie Ser Glu Asn Met Gin Asn Phe 
370 375 380 

GTA GAT GTT TTA AGT GAT AAT TGG TTT ACA CGT GGA ATT GAT 14 21 
Val Asp Val Leu Ser Asp Asn Trp Phe Thr Arg Gly lie Asp 
385 390 395 

10 GAG CAA GTA AAG TTA ACT GTT AAA AAT CAG GAA GAA CCA GTT 14 63 
Glu Gin Val Lys Leu Thr Val Lys Asn Gin Glu Glu Pro Val 
400 405 410 

TTT TAT TAT GTT TAT AAT TTT GAT GAA AAT TCT CCA AGT CGG 15 05 
Phe Tyr Tyr Val Tyr Asn Phe Asp Glu Asn Ser Pro Ser Arg 
15 415 420 425 

AAA GTT TTT GGT GAT TTT GGA ATA AAA GGC GGT GGT CAT GCT 154 7 
Lys Val Phe Gly Asp Phe Gly lie Lys Gly Gly Gly His Ala 
430 435 

GAT GAA TTG GGT AAT ATA TTT AAA GCC AAA AGT GCA AAT TTT 15 8 9 
20 Asp Glu Leu Gly Asn lie Phe Lys Ala Lys Ser Ala Asn Phe 
440 445 450 

GGG AAG GAA ACA CCA AAT GCT GTG TTG GTT CAG AGA AGG ATG 1631 
Gly Lys Glu Thr Pro Asn Ala Val Leu Val Gin Arg Arg Met 
455 460 465 

25 CTG GAG ATG TGG ACT AAT TTT GCT AAA TTT GGA AAT CCT ACT 1673 
Leu Glu Met Trp Thr Asn Phe Ala Lys Phe Gly Asn Pro Thr 
470 475 480 

CCA GCT ATT ACG GAT ACA CTT CCA ATA AAA TGG GAA CCT GCT 1715 
Pro Ala lie Thr Asp Thr Leu Pro lie Lys Trp Glu Pro Ala 
30 485 490 495 

TTT AAA GAA AAT ATG ACT TTT GTT CAA ATT GAC ATT GAT TTA 1757 
Phe Lys Glu Asn Met Thr Phe Val Gin lie Asp lie Asp Leu 
500 505 

AA r : TTG AGT ACT GAT CCA CTA AAA AGT CGT ATG GAA TTT GGG 1799 
35 a- n Leu Ser Thr Asp Pro Leu Lys Ser Arg Met Glu Phe Gly 
510 515 520 

AAT AAA ATA AAA TTA TTA AAA . TAAGTAACTA TACTTAGCTA 184 0 

Asn Lys lie Lys Leu Leu Lys 
525 530 

40 AAC CAT AAT A TACCAAATAA TAGTATAGGA ATACTTCACA ATTTTTTGTT 18 90 

ACTTCGTTAA GTAAATTTAA TTTTTTATAA AACCAACTTT TACGAATAAA 194 0 
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FC-1-C1-PCT 



AAATGTAATT ATTTTGGAAA AAAAAAAGAA AAAAAAAAAA AAAAAAC 



1987 



(2) 



INFORMATION FOR SEQ ID NO: 68: 



5 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Met Cys Asp Pro Leu Leu Lys Thr Thr Thr Tyr Gly lie Leu 
10 1 ' ~ 5 10 

Lys Gly Lys Lys Val Val Asn Glu Asn Gly Lys lie Tyr Tyr 
15 20 25 

Ser Tyr Thr Gly lie Pro Tyr Ala Lys Ser Pro Val Asn Asp 
30 35 40 

15 Leu Arg Phe Lys Pro Pro Gin Lys Leu Asp Pro Trp Asn Gly 
45 50 55 

Val Phe Asp Ala Thr Gin Tyr Gly Asn Asn Cys Ala Ala Gly 
60 65 70 

Lys Trp Phe Leu Lys Ser Ala Gly Gly Cys Glu Asp Cys Leu 
20 * 75 80 

Tyr Leu Asn lie Tyr Val Pro Gin Asn Thr Ser Glu Asn Pro 
85 90 95 

Leu Pro Val Met Phe Trp lie His Gly Gly Ala Phe Val Val 



25 Gly Ser Gly Asn Ser Asp lie His Gly Pro Asp Tyr Leu lie 

115 120 125 

Glu Tyr Asp He lie Leu Val Thr lie Asn Tyr Arg Leu Gly 

130 135 140 

Pro Leu Gly Phe Leu Asn Leu Glu lie Glu Asp Ala Pro Gly 
30 145 150 

Asn Val Gly Leu Met Asp Gin Val Ala Ala Leu Lys Trp Val 
155 160 165 

Asn Glu Asn lie Ala Thr Phe Ser Gly Asp Pro Lys Asn lie 



100 



105 



110 



170 



175 



180 



35 Thr lie Cys Gly Ala Thr Ala Gly Ala Ala Ser Val His Tyr 
185 190 195 
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FC-1-C1-PCT 



His lie Leu Ser Gin Leu Thr Lys Gly Leu Phe His Lys Ala 
200 205 210 

lie Ala Gin Ser Gly Ser Ala Phe Asn Pro Trp Ala Phe Gin 
215 220 

5 Lys Asn Pro Val Lys Asn Ala Leu Arg Leu Cys Lys Thr Leu 
225 230 235 

Gly Leu Thr Thr Asn Asn Leu Gin Glu Ala Leu Asp Phe Leu 
240 245 250 

Lys Asn Leu Pro Val Glu Thr Leu Leu Asn Thr Lys Leu Pro 
10 255 260 265 

Gin Glu lie Asp Gly Gin Leu Leu Asp Asp Phe Val Phe Val 
270 275 280 

Pro Ser lie Glu Lys Thr Phe Pro Glu Gin Asp Ser Tyr Leu 
285 290 

15 Thr Asp Leu Pro lie Pro lie lie Asn Ser Gly Lys Phe His 
295 300 305 

Lys Val Pro Leu Leu Thr Gly Tyr Asn Ser Ala Glu Gly Asn 
310 315 320 

Leu Phe Phe Met Tyr Leu Lys Thr Asp Pro Asp Leu Leu Asn 
20 325 330 335 

Lys Phe Glu Ala Asp Phe Glu Arg Phe lie Pro Thr Asp Leu 
340 345 350 

Glu Leu Pro Leu Arg Ser Gin Lys Ser lie Ala Leu Gly Glu 
355 360 

25 Ala lie Arg Glu Phe Tyr Phe Gin Asn Lys Thr lie Ser Glu 
365 370 375 

Asn Met Gin Asn Phe Val Asp Val Leu Ser Asp Asn Trp Phe 
380 385 390 

Thr Arg Gly lie Asp Glu Gin Val Lys Leu Thr Val Lys Asn 
30 395 400 405 

Gin Glu Glu Pro Val Phe Tyr Tyr Val Tyr A^n Phe Asp Glu 
410 415 420 

Asn Ser Pro Ser Arg Lys Val Phe Gly Asp Phe Gly lie Lys 
425 430 



35 Gly Gly Gly His Ala Asp Glu Leu Gly Asn lie Phe Lys Ala 
435 440 445 
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FC-1-C1-PCT 



Lys Ser Ala Asn Phe Gly Lys Glu Thr Pro Asn Ala Val Leu 
450 455 460 

Val Gin Arg Arg Met Leu Glu Met Trp Thr Asn Phe Ala Lys 
465 470 475 

5 Phe Gly Asn Pro Thr Pro Ala lie Thr Asp Thr Leu Pro lie 
480 485 490 

Lys Trp Glu Pro Ala Phe Lys Glu Asn Met Thr Phe Val Gin 



Arg Met Glu Phe Gly Asn Lys lie Lys Leu Leu Lys 
520 525 530 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

15 (A) LENGTH: 1987 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GTTTTTTTTT TTTTTTTTTC TTTTTTTTTT CCAAAATAAT TACATTTTTT 5 0 

ATTCGTAAAA GTTGGTTTTA TAAAAAATTA AATTTACTTA ACGAAGTAAC 10 0 

AAAAAATTGT GAAGTATTCC TATACTATTA TTTGGTATAT TATGGTTTAG 15 0 

CTAAGTATAG TTACTTATTT TAATAATTTT ATTTTATTCC CAAATTCCAT 2 00 

25 ACGACTTTTT AGTGGATCAG TACTCAAATT TAAATCAATG TCAATTTGAA 2 50 

CAAAAGTCAT ATTTTCTTTA AAAGCAGGTT CCCATTTTAT TGGAAGTGTA 3 00 

TCCGTAATAG CTGGAGTAGG ATTTCCAAAT TTAGCAAAAT TAGTCCACAT 3 50 

CTCCAGCATC CTTCTCTGAA CCAACACAGC ATTTGGTGTT TCCTTCCCAA 4 00 

AATTTGCACT TTTGGCTTTA AATATATTAC CCAATTCATC AGCATGACCA 45 0 

30 CCGCCTTTTA TTCCAAAATC ACCAAAAACT TTCCGACTTG GAGAATTTTC 500 

ATCAAAATTA TAAACATAAT AAAAAACTGG TTCTTCCTGA TTTTTAACAG 55 0 

TTAAC TTT AC TTGCTCATCA ATTCCACGTG TAAACCAATT ATCACTTAAA 60 0 

ACATCTACAA AATTCTGCAT ATTTTCTGAT ATGGTTTTGT TTTGGAAATA 650 

AAATTCCCTG ATTGCTTCAC CCAGTGCAAT AGATTTTTGT GATCG CAAAG 7 00 

35 GTAATTCTAA GTCAGTTG )T ATAAATCTTT CAAAATCAGC TTCAAATTTA 75 0 

TTTAATAAAT CTGGA'j. J*. . 3T TTTTAAGTAC ATGAAAAATA GATTGCCTTC 80 0 

GGCACTGTTG TAACCTGTCA ACAATGGAAC TTTGTGGAAT TTTCCTGAAT 850 

TTATTATTGG TATTGGCAAG TCAGTTAAGT ACGAATCTTG TTCTGGAAAT 900 

GTTTTTTCAA TCGAAGGTAC AAACACGAAG TCATCCAGCA GTTGACCATC 950 

40 AATTTCTTGG GGTAATTTGG TATTTAACAA TGTTTCTACT GGTAGGTTTT 10 0 0 

TCAAAAAATC CAAGGCTTCT TGAAGGTTGT TTGTGGTAAG GCCTAAGGTT 10 5 0 

TTGCATAGTC GAAGTGCATT CTTAACAGGA TTTTTTTGGA AAGCCCAGGG 1100 

ATT AAAAG C A CTTCCACTTT GTGCTATAGC CTTGTGGAAT AAACCTTTGG 115 0 

TAAGTTGTGA CAAAATGTGA TAATGTACAC TTGCAGCTCC AGCAGTTGCT 12 0 0 
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FC-1-C1-PCT 



CCACAAATTG TAATATTTTT TGGGTCTCCA CTAAAGGTTG CAATATTTTC 1250 

ATTTACCCAT TTTAGGGCTG CAACTTGATC CATCAATCCA ACATTCCCAG 13 00 

GCGCATCTTC GATTTCCAAA TTAAGAAAAC CAAGTGGTCC TAGACGATAA 1350 

TTAATAGTTA CTAAGATAAT ATCATATTCT ATTAAATAAT CAGGACCATG .14 00 

5 TATATCAGAA TTTCCTGATC CGACCACAAA TGCTCCTCCA TGAATCCAAA 14 50 

ACATTACTGG CAAAGGATTT TCTGAAGTGT TTTGTGGGAC ATAGATATTT 1500 

AAGTAAAGGC AATCTTCGCA ACCCCCAGCT GATTTCAAAA ACCATTTCCC 15 5 0 

AGCAGCACAA TTATTTCCAT ACTGAGTGGC GTCAAAAACA CCATTCCAAG 1600 

GATCAAGTTT TTGTGGTGGC TTGAATCTGA GATCATTTAC AGGAGATTTT 165 0 

10 GCATAGGGTA TACCTGTGTA ACTATAGTAA ATTTTACCAT TTTCGTTTAC 17 00 

AACTTTCTTG CCTTTCAGAA TTCCATATGT TGTTGTTTTT AGTAATGGAT 17 5 0 

CACACATTAT AAAATGTTAT TATATATCCA AAATATAATA TGGTTTAATT 1800 

TTATACTAGC TCAAAAATAA TGTAAATCAT AGCCAAAACA CACACTCATA 1850 

AACATAAATA AAAATTTCTT TTCGATCATA AAAAAAATAT TTTTTTATAA 1900 

15 ATAAAACTAT ATATATTTAA ATTAAAATTA GATTAAAATA AAAATTAAAT 195 0 

ACATTTATAT CAAATAAAAT TATTTACACT GTGAATT 19 87 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1590 nucleotides 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
25 (A) NAME /KEY : CDS 

(B) LOCATION: 1..1590 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ATG TGT GAT CCA TTA CTA AAA ACA ACA ACA TAT GGA ATT CTG 4 2 

Met Cys Asp Pro Leu Leu Lys Thr Thr Thr Tyr Gly lie Leu 
30 1 5 10 

AAA GGC AAG AAA GTT GTA AAC GAA AAT GGT AAA ATT TAC TAT 84 
Lys Gly Lys Lys Val Val Asn Glu Asn Gly Lys lie Tyr Tyr 
15 20 25 

AGT TAC ACA GGT ATA CCC TAT GCA AAA TCT CCT GTA AAT GAT 12 6 

35 Ser Tyr Thr Gly lie Pro Tyr Ala Lys Ser Pro Val Asn Asp 
30 35 40 

CTC AGA TTC AAG CCA CCA CAA AAA CTT GAT CCT TGG AAT GGT 168 
Leu Arg Phe Lys Pro Pro Gin Lys Leu Asp Pro Trp Asn Gly 
45 50 55 

40 GTT TTT GAC GCC ACT CAG TAT GGA AAT AAT TGT GCT GCT GGG 210 
Val Phe Asp Ala Thr Gin Tyr Gly Asn Asn Cys Ala Ala Gly 
60 65 70 
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AAA TGG 
Lys Trp 



TAC TTA 
5 Tyr Leu 
85 

TTG CCA 
Leu Pro 
100 

10 GGA TCA 
Gly Ser 



GAA TAT 
Glu Tyr 

15 

CCA CTT 
Pro Leu 



AAT GTT 
20 Asn Val 
155 

AAT GAA 
Asn Glu 
170 

25 ACA ATT 
Thr lie 



CAC ATT 
His He 

30 

ATA GCA 
He Ala 



AAA AAT 
35 Lys Asn 
225 

GGC CTT 
Gly Leu 
240 



TTT TTG AAA 
Phe Leu Lys 
75 

AAT ATC TAT 
Asn He Tyr 



GTA ATG TTT 
Val Met Phe 



GGA AAT TCT 
Gly Asn Ser 
115 

GAT ATT ATC 
Asp He He 
130 

GGT TTT CTT 
Gly Phe Leu 
145 

GGA TTG ATG 
Gly Leu Met 



AAT ATT GCA 
Asn He Ala 



TGT GGA GCA 
Cys Gly Ala 
185 

TTG TCA CAA 
Leu Ser Gin 
200 

CAA AGT GGA 
Gin Ser Gly 
215 

CCT GTT AAG 
Pro Val Lys 



ACC ACA AAC 
Thr Thr Asn 



TCA GCT GGG 
Ser Ala Gly 



GTC CCA CAA 
Val Pro Gin 
90 

TGG ATT CAT 
Trp He His 
105 

GAT ATA CAT 
Asp He His 
120 

TTA GTA ACT 
Leu Val Thr 



AAT TTG GAA 
Asn Leu Glu 



GAT CAA GTT 
Asp Gin Val 
160 

ACC TTT AGT 
Thr Phe Ser 
175 

ACT GCT GGA 
Thr Ala Gly 
190 

CTT ACC AAA 
Leu Thr Lys 

AGT GCT TTT 
Ser Ala Phe 



AAT GCA CTT 
Asn Ala Leu 
230 

AAC CTT CAA 
Asn Leu Gin 
245 



GGT TGC GAA 
Gly Cys Glu 
80 

AAC ACT TCA 
Asn Thr Ser 
95 

GGA GGA GCA 
Gly Gly Ala 

GGT CCT GAT 
Gly Pro Asp 



ATT AAT TAT 
He Asn Tyr 
135 

ATC GAA GAT 
He Glu Asp 
150 

GCA GCC CTA 
Ala Ala Leu 
165 

GGA GAC CCA 
Gly Asp Pro 



GCT GCA AGT 
Ala Ala Ser 



GGT TTA TTC 
Gly Leu Phe 
205 

AAT CCC TGG 
Asn Pro Trp 
220 

CGA CTA TGC 
Arg Leu Cys 
235 

GAA GCC TTG 
Glu Ala Leu 



GAT TGC CTT 
Asp Cys Leu 



GAA AAT CCT 
Glu Asn Pro 



TTT GTG GTC 
Phe Val Val 
110 

TAT TTA ATA 
Tyr Leu He 
125 

CGT CTA GGA 
Arg Leu Gly 
140 

GCG CCT GGG 
Ala Pro Gly 



AAA TGG GTA 
Lys Trp Val 



AAA AAT ATT 
Lys Asn He 
180 

GTA CAT TAT 
Val His Tyr 
195 

CAC AAG GCT 
His Lys Ala 
210 

GCT TTC CAA 
Ala Phe Gin 



AAA ACC TTA 
Lys Thr Leu 

GAT TTT TTG 
Asp Phe Leu 
250 



FC-1-C1-PCT 

252 



294 



336 



378 



420 



462 



504 



546 



588 



630 



672 



71 



756 
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AAA AAC 
Lys Asn 



CAA GAA 
5 Gin Glu 



CCT TCG 
Pro Ser 



10 ACT GAC 
Thr Asp 
295 

AAA GTT 
Lys Val 
15 310 

CTA TTT 
Leu Phe 



AAA TTT 
20 Lys Phe 



GAA TTA 
Glu Leu 



25 GCA ATC 
Ala lie 
365 

AAT ATG 
Asn Met 
30 380 

ACA CGT 
Thr Arg 



CAG GAA 
35 Gin Glu 



AAT TCT 
Asn Ser 



CTA CCA GTA 
Leu Pro Val 
255' 

ATT GAT GGT 
lie Asp Gly 
270 

AtT GAA AAA 
lie Glu Lys 
285 

TTG CCA ATA 
Leu Pro lie 



CCA TTG TTG 
Pro Leu Leu 



TTC ATG TAG 
Phe Met Tyr 
325 

GAA GCT GAT 
Glu Ala Asp 
340 

CCT TTG CGA 
Pro Leu Arg 
355 

AGG GAA TTT 
Arg Glu Phe 



CAG AAT TTT 
Gin Asn Phe 



GGA ATT GAT 
Gly He Asp 
395 

GAA CCA GTT 
Glu Pro Val 
410 

CCA AGT CGG 
Pro Ser Arg 
425 



GAA ACA TTG 
Glu Thr Leu 
260 

CAA CTG CTG 
Gin Leu Leu 



ACA TTT CCA 
Thr Phe Pro 



CCA ATA ATA 

Pro He He 
300 

ACA GGT TAC 
Thr Gly Tyr 
315 

TTA AAA ACA 
Leu Lys Thr 
330 

TTT GAA AGA 
Phe Glu Arg 



TCA CAA AAA 
Ser Gin Lys 



TAT TTC CAA 
Tyr Phe Gin 
370 

GTA GAT GTT 
Val Asp Val 
385 

GAG CAA GTA 
Glu Gin Val 
400 

TTT TAT TAT 
Phe Tyr Tyr 



AAA GTT TTT 
Lys Val Phe 



TTA AAT ACC 
Leu Asn Thr 



GAT GAC TTC 
Asp Asp Phe 
275 

GAA CAA GAT 
Glu Gin Asp 
290 

AAT TCA GGA 
Asn Ser Gly 
305 

AAC AGT GCC 
Asn Ser Ala 



GAT CCA GAT 
Asp Pro Asp 



TTT ATA CCA 
Phe He Pro 
345 

TCT ATT GCA 
Ser He Ala 
360 

AAC AAA ACC 
Asn Lys Thr 
375 

TTA AGT GAT 
Leu Ser Asp 



AAG TTA ACT 
Lys Leu Thr 



GTT 7 AT AAT 
Va A .yr Asn 
415 

GGT GAT TTT 
Gly Asp Phe 
430 



AAA TTA CCC 
Lys Leu Pro 
265 

GTG TTT GTA 
Val Phe Val 
280 

TCG TAC TTA 
Ser Tyr Leu 



AAA TTC CAC 
Lys Phe His 



GAA GGC AAT 
Glu Gly Asn 
320 

TTA TTA AAT 
Leu Leu Asn 
335 

ACT GAC TTA 
Thr Asp Leu 
350 

CTG GGT GAA 
Leu Gly Glu 



ATA TCA GAA 
He Ser Glu 



AAT TGG TTT 
Asn Trp Phe 
390 

GTT AAA AAT 
Val Lys Asn 
405 

TTT GAT GAA 
Phe Asp Glu 
420 

GGA ATA AAA 
Gly He Lys 



FC-1-C1-PCT 

798 



840 



882 



924 



966 



1008 



1050 



1092 



1134 



1176 



1218 



1260 



1302 



-203- FC-1-C1-PCT 



GGC GGT GGT CAT GCT GAT GAA TTG GGT AAT ATA TTT AAA GCC 1344 

Gly Gly Gly His Ala Asp Glu Leu Gly Asn He Phe Lys Ala 
435 440 445 

AAA AGT GCA AAT TTT GGG AAG GAA ACA CCA AAT GCT GTG TTG 13 86 

5 Lys Ser Ala Asn Phe Gly Lys Glu Thr Pro Asn Ala Val Leu 
450 455 460 

GTT CAG AGA AGG ATG CTG GAG ATG TGG ACT AAT TTT GCT AAA 14 2 8 
Val Gin Arg Arg Met Leu Glu Met Trp Thr Asn Phe Ala Lys 
465 470 475 

10 TTT GGA AAT CCT ACT CCA GCT ATT ACG GAT ACA CTT CCA ATA 14 7 0 
Phe Gly Asn Pro Thr Pro Ala He Thr Asp Thr Leu Pro He 
480 485 490 

AAA TGG GAA CCT GCT TTT AAA GAA AAT ATG ACT TTT GTT CAA 1512 
Lys Trp Glu Pro Ala Phe Lys Glu Asn Met Thr Phe Val Gin 
15 495 500 

ATT GAC ATT GAT TTA AAT TTG AGT ACT GAT CCA CTA AAA AGT 1554 
He Asp He Asp Leu Asn Leu Ser Thr Asp Pro Leu Lys Ser 
505 510 515 

CGT ATG GAA TTT GGG AAT AAA ATA AAA TTA TTA AAA 159 0 

20 Arg Met Glu Phe Gly Asn Lys He Lys Leu Leu Lys 
520 525 530 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1590 nucleotides 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

30 TTTTAATAAT TTTATTTTAT TCCCAAATTC CATACGACTT TTTAGTGGAT 5 0 

CAGTACTCAA ATTTAAATCA ATGTCAATTT GAACAAAAGT CATATTTTCT 10 0 

TTAAAAGCAG GTTCCCATTT TATTGGAAGT GTATCCGTAA TAGCTGGAGT 15 0 

AGGATTTCC" 7 P TTTAGCAA AATTAGTCCA CATCTCCAGC ATCCTTCTCT 200 

GAACCAACAC A JCATTTGGT GTTTCCTTCC CAAAATTTGC ACTTTTGGCT 2 50 

35 TTAAATATAT TACCCAATTC ATCAGCATGA CCACCGCCTT TTATTC C AAA 300 

ATCACCAAAA ACTTTCCGAC TTGGAGAATT TTCATCAAAA TTATAAACAT 350 

AATAAAAAAC TGGTTCTTCC TGATTTTTAA CAGTTAACTT TACTTGCTCA 4 00 

TCAATTCCAC GTGTAAACCA ATTATCACTT AAAACATCTA CAAAATTCTG 450 

CATATTTTCT GATATGGTTT TGTTTTGGAA ATAAAATTCC CTGATTGCTT 500 

40 CACCCAGTGC AATAGATTTT TGTGATCGCA AAGGTAATTC TAAGTCAGTT 550 

GGTATAAATC TTTCAAAATC AGCTTCAAAT TTATTTAATA AATCTGGATC 60 0 

TGTTTTTAAG TACATGAAAA ATAGATTGCC TTCGGCACTG TTGTAACCTG 650 
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TCAACAATGG AACTTTGTGG AATTTTCCTG AATTTATTAT TGGTATTGGC 7 00 

AAGTCAGTTA AGTACGAATC TTGTTCTGGA AATGTTTTTT CAATCGAAGG 750 

TACAAACACG AAGTCATCCA GCAGTTGACC ATCAATTTCT TGGGGTAATT 800 

TGGTATTTAA CAATGTTTCT ACTGGTAGGT TTTTCAAAAA ATCCAAGGCT 850 

5 TCTTGAAGGT TGTTTGTGGT AAGGCCTAAG GTTTTG C ATA GTCGAAGTGC 900 

ATTCTTAACA GGATTTTTTT GGAAAGCCCA GGGATTAAAA GCACTTCCAC 95 0 

TTTGTGCTAT AGCCTTGTGG AATAAACCTT TGGTAAGTTG TGACAAAATG 1000 

TGATAATGTA CACTTGCAGC TCCAGCAGTT GCTCCACAAA TTGTAATATT 1050 

TTTTGGGTCT CCACTAAAGG TTGCAATATT TTCATTTACC CATTTTAGGG 1100 

10 CTGCAACTTG ATCCATCAAT CCAACATTCC CAGGCGCATC TTCGATTTCC 1150 

AAATTAAGAA AACCAAGTGG TCCTAGACGA TAATTAATAG TTACTAAGAT 12 00 

AATATCATAT TCTATTAAAT AATCAGGACC ATGTATATCA GAATTTCCTG 12 50 

ATCCGACCAC AAATGCTCCT CCATGAATCC AAAACATTAC TGGCAAAGGA 13 00 

TTTTCTGAAG TGTTTTGTGG GACATAGATA TTTAAGTAAA GGCAATCTTC 1350 

15 GCAACCCCCA GCTGATTTCA AAAACCATTT CCCAGCAGCA CAATTATTTC 14 0 0 

CATACTGAGT GGCGTCAAAA ACACCATTCC AAGGATCAAG TTTTTGTGGT 14 50 

GGCTTGAATC TGAGATCATT TACAGGAGAT TTTGCATAGG GTATACCTGT 150 0 

GTAACTATAG TAAATTTTAC CATTTTCGTT TACAACTTTC TTGCCTTTCA 155 0 

G AATT C CAT A TGTTGTTGTT TTTAGTAATG GATCACACAT 15 9 0 



20 (2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 0 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 3. .650 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

GG ATC CAT GGA GGC GCA TTC AAC CAA GGA TCA GGA TCT TAT 41 
lie His Gly Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr 
15 10 

AAT TTT TTT GGA CCT GAT TAT TTG ATC AGG GAA GGA ATT ATT 8 3 

35 Asn Phe Phe Gly Pro Asp Tyr Leu lie Arg Glu Gly lie lie 
15 20 25 

TTG GTC ACT ATC AAC TAT AGA TTA GGA GTT TTC GGT TTT CTA 12 5 

Leu Val Thr lie Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu 
30 35 40 

40 TCA GCG CCG GAA TGG GAT ATC CAT GGA AAT ATG GGT CTA AAA 167 
Ser Ala Pro Glu Trp Asp lie His Gly Asn Met Gly Leu Lys 
45 50 55 
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GAC CAG AGA TTG GCA CTA AAA TGG GTT TAC GAC AAC ATC GAA 2 09 

Asp Gin Arg Leu Ala Leu Lys Trp Val Tyr Asp Asn lie Glu 
60 65 

AAG TTT GGT GGA GAC AGA GAA AAA ATT ACA ATT GCT GGA GAA 251 
5 Lys Phe Gly Gly Asp Arg Glu Lys lie Thr lie Ala Gly Glu 
70 75 80 

TCT GCT GGA GCA GCA AGT GTC CAT TTT CTG ATG ATG GAC AAC 2 93 

Ser Ala Gly Ala Ala Ser Val His Phe Leu Met Met Asp Asn 
85 90 95 

10 TCG ACT AGA AAA TAC TAC CAA AGG GCC ATT TTG CAG AGT GGG 33 5 

Ser Thr Arg Lys Tyr Tyr Gin Arg Ala lie Leu Gin Ser Gly 
100 105 110 

ACA TTA CTA AAT CCG ACT GCT AAT CAA ATT CAA CTT CTG CAT 3 77 

Thr Leu Leu Asn Pro Thr Ala Asn Gin lie Gin Leu Leu His 
15 115 120 125 

AGA TTT GAA AAA CTC AAA CAA GTG CTA AAC ATC ACG CAA AAA 419 
Arg Phe Glu Lys Leu Lys Gin Val Leu Asn lie Thr Gin Lys 
130 135 

CAA GAA CTC CTA AAC CTG GAT AAA AAC CTA ATT TTA CGA GCA 4 61 

20 Gin Glu Leu Leu Asn Leu Asp Lys Asn Leu lie Leu Arg Ala 

140 145 150 

GCC TTA AAC AGA GTT CCT GAT AGC AAC GAC CAT GAC CGA GAC 503 

Ala Leu Asn Arg Val Pro Asp Ser Asn Asp His Asp Arg Asp 
155 160 165 

25 ACA GTA CCA GTA TTT AAT CCA GTC TTA GAA TCA CCA GAA TCT 54 5 

Thr Val Pro Val Phe Asn Pro Val Leu Glu Ser Pro Glu Ser 
170 175 180 

CCA GAT CCA ATA ACA TTT CCA TCT GCC TTG GAA AGA ATG AGA 5 87 

Pro Asp Pro lie Thr Phe Pro Ser Ala Leu Glu Arg Met Arg 
30 185 190 195 

AAT GGT GAA TTT CCT GAT GTC GAT GTC ATC ATT GGT TTC AAT 62 9 

Asn Gly Glu Phe Pro Asp Val Asp Val lie lie Gly Phe Asn 
200 205 

AGT GCT GAA GGT TTA AGA TCT 65 0 

35 Ser Ala Glu Gly Leu Arg Ser 
210 215 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 nucleotides 
40 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

lie His Gly Gly Ala Phe Asn Gin Gly Ser Gly Ser Tyr Asn 
5 1 5 10 

Phe Phe Gly Pro Asp Tyr Leu lie Arg Glu Gly lie lie Leu 
15 20 25 

Val Thr He Asn Tyr Arg Leu Gly Val Phe Gly Phe Leu Ser 
30 35 40 

10 Ala Pro Glu Trp Asp He His Gly Asn Met Gly Leu Lys Asp 
45 50 55 

Gin Arg Leu Ala Leu Lys Trp Val Tyr Asp Asn He Glu Lys 
60 65 70 

Phe Gly Gly Asp Arg Glu Lys He Thr He Ala Gly Glu Ser 
15 75 80 

Ala Gly Ala Ala Ser Val His Phe Leu Met Met Asp Asn Ser 
85 90 95 

Thr Arg Lys Tyr Tyr Gin Arg Ala He Leu Gin Ser Gly Thr 
100 105 110 

20 Leu Leu Asn Pro Thr Ala Asn Gin He Gin Leu Leu His Arg 
115 120 125 

Phe Glu Lys Leu Lys Gin Val Leu Asn He Thr Gin Lys Gin 
130 135 140 

Glu Leu Leu Asn Leu Asp Lys Asn Leu He Leu Arg Ala Ala 
25 145 150 

Leu Asn Arg Val Pro Asp Ser Asn Asp His Asp Arg Asp Thr 
155 160 165 

Val Pro Val Phe Asn Pro Val Leu Glu Ser Pro Glu Ser Pro 
170 175 180 

30 Asp Pro He Thr Phe Pro Ser A_a Leu Glu Arg Met Arg Asn 
185 190 195 

Gly Glu Phe Pro Asp Val Asp Val He He Gly Phe Asn Ser 
200 205 210 

Ala Glu Gly Leu Arg Ser 
35 215 
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(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) 
(ix) 



MOLECULE TYPE: peptide 



FEATURE : 

(A) NAME/ KEY 

(B) LOCATION 

(C) NAME/ KEY 

(D) LOCATION 

(E) NAME/KRY 

(F) LOCATION 



Xxn = Tyr or Gly 
3 

Xxn = Lys or Tyr or Gly 
5 

Xxn = Val or Gin or Asn 
6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Asp Leu Xxn Val Xxn Xxn Leu Gin Gly Thr Leu Lys Gly Lys 
15 10 



Glu 
15 



(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



CGCGGATCCG CTGATCTACA AGTGACTTTG C 



(2) INFORMATION FOR SEQ ID NO: 76: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1488 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3.. 1487 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CC CAG GGC GAA TTG GTT GGA AAA GCT TTG ACG AAC GAA AAT GGA 44 
Gin Gly Glu Leu Val Gly Lys Ala Leu Thr Asn Glu Asn Gly 
15 10 

5 

AAA GAG TAT TTT AGC TAC ACA GGT GTG CCT TAT GCT AAA CCT 8 6 

Lys Glu Tyr Phe Ser Tyr Thr Gly Val Pro Tyr Ala Lys Pro 
15 20 25 

10 CCA GTT GGA GAA CTT AGA TTT AAG CCT CCA CAG AAA GCT GAG 12 8 

Pro Val Gly Glu Leu Arg Phe Lys Pro Pro Gin Lys Ala Glu 
30 35 40 

CCA TGG AAT GGT GTT TTC AAC GCC ACA TCA CAT GGA AAT GTG 17 0 

15 Pro Trp Asn Gly Val Phe Asn Ala Thr Ser His Gly Asn Val 
45 50 55 

TGC AAA GCT TTG AAT TTC TTC TTG AAA AAA ATT GAA GGA GAC 212 
Cys Lys Ala Leu Asn Phe Phe Leu Lys Lys lie Glu Gly Asp 
20 60 65 70 

GAA GAC TGC TTG TTG GTG AAT GTG TAC GCA CCA AAA ACA ACT 2 54 

Glu Asp Cys Leu Leu Val Asn Val Tyr Ala Pro Lys Thr Thr 
75 80 

25 

TCT GAC AAA AAA CTT CCA GTA TTT TTC TGG GTT CAT GGT GGC 2 96 

Ser Asp Lys Lys Leu Pro Val Phe Phe Trp Val His Gly Gly 
85 90 95 

30 GGT TTT GTG ACT GGA TCC GGA AAT TTA GAA TTT CAA AGC CCA 33 8 

Gly Phe Val Thr Gly Ser Gly Asn Leu Glu Phe Gin Ser Pro 
100 105 110 

GAT TAT TTA GTA AAT TAT GAT GTT ATT TTT GTA ACT TTC AAT 3 80 

35 Asp Tyr Leu Val Asn Tyr Asp Val lie Phe Val Thr Phe Asn 
115 120 125 

TAC CGA TTG GGA CCA CTC GGA TTT TTG AAT TTG GAG TTG GAA 422 
Tyr Arg Leu Gly Pro Leu Gly Phe Leu Asn Leu Glu Leu Glu 
40 130 135 140 

GGT GCT CCT GGA AAT GTA GGA TTA TTG GAT CAG GTA GCA GCT 4 64 

Gly Ala Pro Gly Asn Val Gly Leu Leu Asp Gin Val Ala Ala 
145 150 

45 

TTG AAA TGG ACC AAA GAA AAT ATT GAG AAA TTT GGT GGA GAT 506 
Leu Lys Trp Thr Lys Glu Asn lie Glu Lys Phe Gly Gly Asp 
155 160 165 

50 CCA GAA AAT ATT ACA ATT GGT GGT GTT TCT GCT GGT GGA GCA 54 8 

Pro Glu Asn He Thr He Gly Gly Val Ser Ala Gly Gly Ala 
170 175 180 
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AGT GTT 
Ser Val 



TAC AAA 
5 Tyr Lys 



TGG GCC 
Trp Ala 

10 

GCT GAG 
Ala Glu 
225 

15 

TTA GAA 
Leu Glu 
240 

20 AAA ATG 
Lys Met 



CAT TAT CTT 
His Tyr Leu 
185 

AGG GCA ATT 
Arg Ala lie 
200 

TTC CAA AGA 
Phe Gin Arg 
215 

ATA TTA GGT 
lie Leu Gly 



TTC TTA CAA 
Phe Leu Gin 



CCA GCT GAA 
Pro Ala Glu 
255 



TTA TTG TCA 
Leu Leu Ser 
190 

GCT CAA AGT 
Ala Gin Ser 



CAT CCA GTA 
His Pro Val 



CAT CCC ACA 
His Pro Thr 
230 

AAA GCC CCA 
Lys Ala Pro 
245 

ACA GAA GGT 
Thr Glu Gly 
260 



CAT ACA ACC 
His Thr Thr 



GGA AGT GCT 
Gly Ser Ala 
205 

AAG CGT AGT 
Lys Arg Ser 
220 

AAC AAC ACT 
Asn Asn Thr 
235 

GTA GAC AGT 
Val Asp Ser 



GAA ATA ATA 
Glu lie lie 



ACT GGA CTT 
Thr Gly Leu 
195 

TTA AAT CCA 
Leu Asn Pro 
210 

CTT CAA CTT 
Leu Gin Leu 



CAA GAT GCT 
Gin Asp Ala 

CTC CTG AAA 
Leu Leu Lys 
250 

GAA GAG TTC 
Glu Glu Phe 
265 



GTC TTC GTA CCA TCA ATT GAA AAA GTT TTC CCA TCC CAC CAA 
25 Val Phe Val Pro Ser lie Glu Lys Val Phe Pro Ser His Gin 
270 275 280 



CCT TTC 
Pro Phe 

30 

TCC TTT 
Ser Phe 
295 

35 

GAA GGA 
Glu Gly 
310 

40 ATG CTG 
Met Leu 



TTG GAA GAA 
Leu Glu Glu 
285 

AAC AAA GTA 
Asn Lys Val 



CTT TTG TAC 
Leu Leu Tyr 



AAC CAA GCT 
Asn Gin Ala 
325 



TCA CCA TTG 
Ser Pro Leu 



CCT TTA TTA 
Pro Leu Leu 
300 

AAA TTC TTT 
Lys Phe Phe 
315 

GAA GCA GAT 
Glu Ala Asp 
330 



GCC AGA ATG 
Ala Arg Met 
290 

GTT GGA TTC 
Val Gly Phe 
305 

ATG AAA GAA 
Met Lys Glu 



TTC GAA AGA 
Phe Glu Arg 



AAA TCT GGA 
Lys Ser Gly 



AAC AGC GCA 
Asn Ser Ala 



AAA CCA GAG 
Lys Pro Glu 
320 

CTC GTA CCA 
Leu Val Pro 
335 



GCC GAA TTT GAA TTA GCC CAT GGA TCA GAA GAA TCG AAA AAA 
45 Ala Glu Phe Glu Leu Ala His Gly Ser Glu Glu Ser Lys Lys 
340 345 350 

CTT GCA GAA AAA ATC AGG AAG TTT TAC TTT GAC GAT AAA CCC 
Leu Ala Glu Lys lie Arg Lys Phe Tyr Phe Asp Asp Lys Pro 
50 355 360 



FC-1-C1-PCT 

590 



632 



674 



716 



758 



800 



842 



884 



926 



968 



1010 



1052 



1094 
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GTT CCT 
Val Pro 
365 

5 ATT TGG 
lie Trp 
380 

GTA GAA 
10 Val Glu 



TTC TCT 
Phe Ser 

15 

AAC TTG 
Asn Leu 

20 

TTC AAA 
Phe Lys 
435 

25 GTT TTA 
Val Leu 
450 



GAA AAT GAG 
Glu Asn Glu 



TTT ACT AGA 
Phe Thr Arg 



AAA CAA GAC 
Lys Gin Asp 
395 

GAA AGT CAT 
Glu Ser His 
410 

ACT GGA GCA 
Thr Gly Ala 
425 

GTC GAG ATG 
Val Glu Met 



TTA ACA AAA 
Leu Thr Lys 



CAG AAA TTT 
Gin Lys Phe 
370 

GGC ATT GAC 
Gly lie Asp 
385 

GAG CCA GTA 
Glu Pro Val 
400 

CCT GCA AAA 
Pro Ala Lys 



TGT CAT GGT 
Cys His Gly 



ATG AAG CTG 
Met Lys Leu 
440 

GAT AGG GTA 
Asp Arg Val 
455 



ATT GAC TTG 
lie Asp Leu 
375 

AAG CAT GTC 
Lys His Val 



TAT TAT TAT 
Tyr Tyr Tyr 



GGA ACA TTT 
Gly Thr Phe 
415 

GAA GAA CTT 
Glu Glu Leu 
430 

GAA AAA GAT 
Glu Lys Asp 
445 

CTT GCT ATG 
Leu Ala Met 



ATA GGA GAT 
lie Gly Asp 

AAG TTG TCT 
Lys Leu Ser 
390 

GAA TAT TCT 
Glu Tyr Ser 
405 

GGT GAC CAT 
Gly Asp His 
420 

GTG AAT TTA 
Val Asn Leu 



AAA CCG AAT 
Lys Pro Asn 



TGG ACG AAC 
Trp Thr Asn 
460 



TTC ATC AAA AAT GGA AAT CCT ACT CCT GAA GTA ACT GAA TTA 
30 Phe lie Lys Asn Gly Asn Pro Thr Pro Glu Val Thr Glu Leu 
465 470 475 

TTG CCA GTT AAA TGG GAA CCT GCC ACA AAA GAC AAG TTG AAT 
Leu Pro Val Lys Trp Glu Pro Ala Thr Lys Asp Lys Leu Asn 
35 480 485 490 



TAT TTG AAC ATT GAT G 
Tyr Leu Asn lie Asp 
495 
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1136 
1178 
1220 
1262 
1304 
1346 
1388 
1430 
1472 
1488 
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While various embodiments of the present invention have been described in 
detail, it is apparent that modifications and adaptations of those embodiments will occur 
to those skilled in the art. It is to be expressly understood, however, that such 
modifications and adaptations are within the scope of the present invention, as set forth 
in the following claims. 



